снартев 3

THE ELECTROMAGNETIC FIELD

The rotating armatures of every generator and every motor in this age of electricity are steadily proclaiming the truth of the relativity theory to all who have ears to hear.
LEIGH PAGE (1941)

§3.1. THE LORENTZ FORCE AND THE ELECTROMAGNETIC FIELD TENSOR

At the opposite extreme from an impulsive change of momentum in a collision (the last topic of Chapter 2) is the gradual change in the momentum of a charged particle under the action of electric and magnetic forces (the topic treated here).
Let electric and magnetic fields act on a system of charged particles. The accelerations of the particles reveal the electric and magnetic field strengths. In other words, the Lorentz force law, plus measurements on the components of acceleration of test particles, can be viewed as defining the components of the electric and magnetic fields. Once the field components are known from the accelerations of a few test particles, they can be used to predict the accelerations of other test particles (Box 3.1). Thus the Lorentz force law does double service (1) as definer of fields and (2) as predicter of motions.
Here and elsewhere in science, as stressed not least by Henri Poincaré, that view is out of date which used to say, "Define your terms before you proceed." All the laws and theories of physics, including the Lorentz force law, have this deep and subtle character, that they both define the concepts they use (here B and E E E\boldsymbol{E}E ) and make statements about these concepts. Contrariwise, the absence of some body of theory, law, and principle deprives one of the means properly to define or even to use concepts. Any forward step in human knowledge is truly creative in this sense: that theory, concept, law, and method of measurement-forever inseparable-are born into the world in union.
Lorentz force as definer of fields and predicter of motions

Box 3.1 LORENTZ FORCE LAW AS BOTH DEFINER OF FIELDS AND PREDICTER OF MOTIONS

How one goes about determining the components of the field from measurements of accelerations is not different in principle for electromagnetism and for gravitation. Compare the equations in the two cases:
(1) d 2 x α d τ 2 = e m F α β u β in a Lorentz frame, (1) d 2 x α d τ 2 = e m F α β u β  in a Lorentz frame,  {:(1)(d^(2)x^(alpha))/(dtau^(2))=(e)/(m)F^(alpha)_(beta)u^(beta)" in a Lorentz frame, ":}\begin{equation*} \frac{d^{2} x^{\alpha}}{d \tau^{2}}=\frac{e}{m} F^{\alpha}{ }_{\beta} u^{\beta} \text { in a Lorentz frame, } \tag{1} \end{equation*}(1)d2xαdτ2=emFαβuβ in a Lorentz frame, 
and
D 2 ξ α d τ 2 = R α β γ δ u β ξ γ u δ D 2 ξ α d τ 2 = R α β γ δ u β ξ γ u δ (D^(2)xi^(alpha))/(dtau^(2))=-R^(alpha)_(beta gamma delta)u^(beta)xi^(gamma)u^(delta)\frac{D^{2} \xi^{\alpha}}{d \tau^{2}}=-R^{\alpha}{ }_{\beta \gamma \delta} u^{\beta} \xi^{\gamma} u^{\delta}D2ξαdτ2=Rαβγδuβξγuδ in any coordinate system.
To make explicit the simpler procedure for electromagnetism will indicate in broad outline how one similarly determines all the components of R α β γ δ R α β γ δ R^(alpha)_(beta gamma delta)R^{\alpha}{ }_{\beta \gamma \delta}Rαβγδ for gravity. Begin by asking how many test particles one needs to determine the three components of B B B\boldsymbol{B}B and the three components of E E E\boldsymbol{E}E in the neighborhood under study. For one particle, three components of acceleration are measurable; for a second particle, three more. Enough? No! The information from the one duplicates in part the information from the other. The proof? Whatever the state of motion of the first test particle, pick one's Lorentz frame to be moving the same way. Having zero velocity in this frame, the particle has a zero response to any magnetic field. The electric field alone acts on the particle. The three components of its acceleration give directly the three components E x , E y , E z E x , E y , E z E_(x),E_(y),E_(z)E_{x}, E_{y}, E_{z}Ex,Ey,Ez of the electric field. The second test particle cannot be at rest if it is to do more than duplicate the information provided by the first test particle. Orient the x x xxx-axis of the frame
of reference parallel to the direction of motion of this second particle, which will then respond to and measure the components B y B y B_(y)B_{y}By and B z B z B_(z)B_{z}Bz of the magnetic field. Not so B x B x B_(x)B_{x}Bx ! The acceleration in the x x xxx-direction merely remeasures the already once measured E x E x E_(x)E_{x}Ex. To evaluate B x B x B_(x)B_{x}Bx, a third test particle is required, but it then gives duplicate information about the other field components. The alternative? Use all N N NNN particles simultaneously and on the same democratic footing, both in the evaluation of the six F α β F α β F_(alpha beta)F_{\alpha \beta}Fαβ and in the testing of the Lorentz force, by applying the method of least squares. Thus, write the discrepancy between predicted and observed acceleration of the K K KKK th particle in the form
(3) u ˙ α K e m F α β u β , K = δ a α K (3) u ˙ α K e m F α β u β , K = δ a α K {:(3)u^(˙)_(alpha)^(K)-(e)/(m)F_(alpha beta)u^(beta,K)=deltaa_(alpha)^(K):}\begin{equation*} \dot{u}_{\alpha}^{K}-\frac{e}{m} F_{\alpha \beta} u^{\beta, K}=\delta a_{\alpha}^{K} \tag{3} \end{equation*}(3)u˙αKemFαβuβ,K=δaαK
Take the squared magnitude of this discrepancy and sum over all the particles
(4) S = k η α β δ a α K δ a K β . (4) S = k η α β δ a α K δ a K β . {:(4)S=sum_(k)eta^(alpha beta)deltaa_(alpha)^(K)deltaa^(K)_(beta).:}\begin{equation*} S=\sum_{k} \eta^{\alpha \beta} \delta a_{\alpha}{ }^{K} \delta a^{K}{ }_{\beta} . \tag{4} \end{equation*}(4)S=kηαβδaαKδaKβ.
In this expression, everything is regarded as known except the six F α β F α β F_(alpha beta)F_{\alpha \beta}Fαβ. Minimize with respect to these six unknowns. In this way, arrive at six equations for the components of B B B\boldsymbol{B}B and E E E\boldsymbol{E}E. These equations once solved, one goes back to (3) to test the Lorentz force law.
The 6 × 6 6 × 6 6xx66 \times 66×6 determinant of the coefficients in the equation for the F α β F α β F_(alpha beta)F_{\alpha \beta}Fαβ automatically vanishes when there are only two test particles. The same line of reasoning permits one to determine the minimum number of test particles required to determine all the components of the Riemann curvature tensor.
The Lorentz force law, written in familiar three-dimensional notation, with E = E = E=\boldsymbol{E}=E= electric field, B = B = B=\boldsymbol{B}=B= magnetic field, v = v = v=\boldsymbol{v}=v= ordinary velocity of particle, p = p = p=\boldsymbol{p}=p= momentum of particle, e = e = e=e=e= charge of particle, reads
(3.1) ( d p / d t ) = e ( E + v × B ) (3.1) ( d p / d t ) = e ( E + v × B ) {:(3.1)(dp//dt)=e(E+v xx B):}\begin{equation*} (d \boldsymbol{p} / d t)=e(\boldsymbol{E}+\boldsymbol{v} \times \boldsymbol{B}) \tag{3.1} \end{equation*}(3.1)(dp/dt)=e(E+v×B)
Useful though this version of the equation may be, it is far from the geometric spirit of Einstein. A fully geometric equation will involve the test particle's energy-momentum 4 -vector, p p p\boldsymbol{p}p, not just the spatial part p p p\boldsymbol{p}p as measured in a specific Lorentz frame; and it will ask for the rate of change of momentum not as measured by a specific Lorentz observer ( d / d t ) ( d / d t ) (d//dt)(d / d t)(d/dt), but as measured by the only clock present à priori in the problem: the test particle's own clock ( d / d τ ) ( d / d τ ) (d//d tau)(d / d \tau)(d/dτ). Thus, the lefthand side of a fully geometric equation will read
d p / d τ = d p / d τ = dp//d tau=d \mathbf{p} / d \tau=dp/dτ=
The righthand side, the Lorentz 4 -force, must also be a frame-independent object. It will be linear in the particle's 4 -velocity u u u\boldsymbol{u}u, since the frame-dependent expression
(3.2a) d p d τ = 1 1 v 2 d p d t = e 1 v 2 ( E + v × B ) = e ( u 0 E + u × B ) (3.2a) d p d τ = 1 1 v 2 d p d t = e 1 v 2 ( E + v × B ) = e u 0 E + u × B {:(3.2a)(dp)/(d tau)=(1)/(sqrt(1-v^(2)))(dp)/(dt)=(e)/(sqrt(1-v^(2)))(E+v xx B)=e(u^(0)E+u xx B):}\begin{equation*} \frac{d \boldsymbol{p}}{d \tau}=\frac{1}{\sqrt{1-v^{2}}} \frac{d \boldsymbol{p}}{d t}=\frac{e}{\sqrt{1-\boldsymbol{v}^{2}}}(\boldsymbol{E}+\boldsymbol{v} \times \boldsymbol{B})=e\left(u^{0} \boldsymbol{E}+\boldsymbol{u} \times \boldsymbol{B}\right) \tag{3.2a} \end{equation*}(3.2a)dpdτ=11v2dpdt=e1v2(E+v×B)=e(u0E+u×B)
is linear in the components of u u u\boldsymbol{u}u. Consequently, there must be a linear machine named Faraday, or F F F\boldsymbol{F}F, or "electromagnetic field tensor," with a slot into which one inserts the 4 -velocity of a test particle. The output of this machine, multiplied by the particle's charge, must be the electromagnetic 4 -force that it feels:
(3.3) d p / d τ = e F ( u ) (3.3) d p / d τ = e F ( u ) {:(3.3)dp//d tau=eF(u):}\begin{equation*} d \boldsymbol{p} / d \tau=e \boldsymbol{F}(\boldsymbol{u}) \tag{3.3} \end{equation*}(3.3)dp/dτ=eF(u)
By comparing this geometric equation with the original Lorentz force law (equation 3.2 a ), and with the companion energy-change law
(3.2b) d p 0 d τ = 1 1 v 2 d E d t = 1 1 v 2 e E v = e E u (3.2b) d p 0 d τ = 1 1 v 2 d E d t = 1 1 v 2 e E v = e E u {:(3.2b)(dp^(0))/(d tau)=(1)/(sqrt(1-v^(2)))(dE)/(dt)=(1)/(sqrt(1-v^(2)))eE*v=eE*u:}\begin{equation*} \frac{d p^{0}}{d \tau}=\frac{1}{\sqrt{1-\boldsymbol{v}^{2}}} \frac{d E}{d t}=\frac{1}{\sqrt{1-v^{2}}} e \boldsymbol{E} \cdot \boldsymbol{v}=e \boldsymbol{E} \cdot \boldsymbol{u} \tag{3.2b} \end{equation*}(3.2b)dp0dτ=11v2dEdt=11v2eEv=eEu
one can read off the components of F F F\boldsymbol{F}F in a specific Lorentz frame. The components of d p / d τ d p / d τ dp//d taud \boldsymbol{p} / d \taudp/dτ are d p α / d τ d p α / d τ dp^(alpha)//d taud p^{\alpha} / d \taudpα/dτ, and the components of e F ( u ) e F ( u ) eF(u)e \boldsymbol{F}(\boldsymbol{u})eF(u) can be written (definition of F α β ! F α β ! F^(alpha)_(beta)!F^{\alpha}{ }_{\beta}!Fαβ! ) e F α β u β e F α β u β eF^(alpha)_(beta)u^(beta)e F^{\alpha}{ }_{\beta} u^{\beta}eFαβuβ. Consequently
(3.4) d p α / d τ = e F α β u β (3.4) d p α / d τ = e F α β u β {:(3.4)dp^(alpha)//d tau=eF^(alpha)_(beta)u^(beta):}\begin{equation*} d p^{\alpha} / d \tau=e F^{\alpha}{ }_{\beta} u^{\beta} \tag{3.4} \end{equation*}(3.4)dpα/dτ=eFαβuβ
must reduce to equations ( 3.2 a , b ) ( 3.2 a , b ) (3.2a,b)(3.2 \mathrm{a}, \mathrm{b})(3.2a,b). Indeed it does if one makes the identification
(3.5) β = 0 α = 1 α = 0 F α β = α = 2 α = 1 α α α 0 E x E y E z E x 0 B z B y E y B z 0 B x E z B y B x 0 . (3.5) β = 0 α = 1 α = 0 F α β = α = 2 α = 1 α α α 0 E x E y E z E x 0 B z B y E y B z 0 B x E z B y B x 0 . {:(3.5){:[beta=0],[alpha=1],[alpha=0],[||F^(alpha)_(beta)||={:[alpha=2],[alpha=1],[alpha],[alpha],[alpha]:}||^(0)]:}||[E_(x),E_(y),E_(z)],[E_(x),0,B_(z),-B_(y)],[E_(y),-B_(z),0,B_(x)],[E_(z),B_(y),-B_(x),0]||.:}\begin{array}{r} \beta=0 \\ \alpha=1 \tag{3.5}\\ \alpha=0 \\ \left\|F^{\alpha}{ }_{\beta}\right\|=\begin{array}{l} \alpha=2 \\ \alpha=1 \\ \alpha \\ \alpha \\ \alpha \end{array} \|^{0} \end{array}\left\|\begin{array}{cccc} E_{x} & E_{y} & E_{z} \\ E_{x} & 0 & B_{z} & -B_{y} \\ E_{y} & -B_{z} & 0 & B_{x} \\ E_{z} & B_{y} & -B_{x} & 0 \end{array}\right\| .(3.5)β=0α=1α=0Fαβ=α=2α=1ααα0ExEyEzEx0BzByEyBz0BxEzByBx0.
The three-dimensional version of the Lorentz force law
Electromagnetic field tensor defined
Geometrical version of Lorentz force law
Components of electromagnetic field tensor
More often seen in the literature are the "covariant components," obtained by lowering an index with the metric components:
(3.6) F α β = η α γ F β γ (3.7) F α β = 0 E x E y E z E r 0 B z B y E y B z 0 B x E z B y B x 0 . (3.6) F α β = η α γ F β γ (3.7) F α β = 0 E x E y E z E r 0 B z B y E y B z 0 B x E z B y B x 0 . {:[(3.6)F_(alpha beta)=eta_(alpha gamma)F_(beta)^(gamma)],[(3.7)||F_(alpha beta)||=||[0,-E_(x),-E_(y),-E_(z)],[E_(r),0,B_(z),-B_(y)],[E_(y),-B_(z),0,B_(x)],[E_(z),B_(y),-B_(x),0]||.]:}\begin{gather*} F_{\alpha \beta}=\eta_{\alpha \gamma} F_{\beta}^{\gamma} \tag{3.6}\\ \left\|F_{\alpha \beta}\right\|=\left\|\begin{array}{cccc} 0 & -E_{x} & -E_{y} & -E_{z} \\ E_{r} & 0 & B_{z} & -B_{y} \\ E_{y} & -B_{z} & 0 & B_{x} \\ E_{z} & B_{y} & -B_{x} & 0 \end{array}\right\| . \tag{3.7} \end{gather*}(3.6)Fαβ=ηαγFβγ(3.7)Fαβ=0ExEyEzEr0BzByEyBz0BxEzByBx0.
This matrix equation demonstrates the unity of the electric and magnetic fields. Neither one by itself, E E E\boldsymbol{E}E or B B B\boldsymbol{B}B, is a frame-independent, geometric entity. But merged together into a single entity, F = F a r a d a y F = F a r a d a y F=Faraday\boldsymbol{F}=\boldsymbol{F a r a d a y}F=Faraday, they acquire a meaning and significance that transcends coordinates and reference frames.

EXERCISE

Exercise 3.1.

Derive equations (3.5) and (3.7) for the components of Faraday by comparing (3.4) with (3.2a,b), and by using definition (3.6).

§3.2. TENSORS IN ALL GENERALITY

Examples of tensors
A digression is in order. Now on the scene are several different tensors: the metric tensor g g g\boldsymbol{g}g (§2.4), the Riemann curvature tensor Riemann (§1.6), the electromagnetic field tensor Faraday ($3.1). Each has been defined as a linear machine with input slots for vectors, and with an output that is either a real number, e.g., g ( u , v ) g ( u , v ) g(u,v)\boldsymbol{g}(\boldsymbol{u}, \boldsymbol{v})g(u,v), or a vector, e.g., Riemann (u,v,w) and Faraday (u).
Should one make a distinction between tensors whose outputs are scalars, and tensors whose outputs are vectors? No! A tensor whose output is a vector can be reinterpreted trivially as one whose output is a scalar. Take, for example, Fara d a y = F d a y = F day=F\boldsymbol{d a y}=\boldsymbol{F}day=F. Add a new slot for the insertion of an arbitrary 1 -form σ σ sigma\boldsymbol{\sigma}σ, and gears and wheels that guarantee the output
(3.8) F ( σ , u ) = σ , F ( u ) = real number. (3.8) F ( σ , u ) = σ , F ( u ) =  real number.  {:(3.8)F(sigma","u)=(:sigma","F(u):)=" real number. ":}\begin{equation*} \boldsymbol{F}(\boldsymbol{\sigma}, \boldsymbol{u})=\langle\boldsymbol{\sigma}, \boldsymbol{F}(\boldsymbol{u})\rangle=\text { real number. } \tag{3.8} \end{equation*}(3.8)F(σ,u)=σ,F(u)= real number. 
Then permit the user to choose whether he inserts only a vector, and gets out the vector F ( , u ) = F ( u ) F ( , u ) = F ( u ) F(dots,u)=F(u)\boldsymbol{F}(\ldots, \boldsymbol{u})=\boldsymbol{F}(\boldsymbol{u})F(,u)=F(u), or whether he inserts a form and a vector, and gets out the number F ( σ , u ) F ( σ , u ) F(sigma,u)\boldsymbol{F}(\boldsymbol{\sigma}, \boldsymbol{u})F(σ,u). The same machine will do both jobs. Moreover, in terms of components in a given Lorentz frame, both jobs are achieved very simply:
F ( , u ) F ( , u ) F(dots,u)\boldsymbol{F}(\ldots, \boldsymbol{u})F(,u) is a vector with components F α β u β F α β u β F^(alpha)_(beta)u^(beta)F^{\alpha}{ }_{\beta} u^{\beta}Fαβuβ;
F ( σ , u ) F ( σ , u ) F(sigma,u)\boldsymbol{F}(\boldsymbol{\sigma}, \boldsymbol{u})F(σ,u) is the number σ , F ( , u ) = σ α F α β u β σ , F ( , u ) = σ α F α β u β (:sigma,F(dots,u):)=sigma_(alpha)F^(alpha)_(beta)u^(beta)\langle\boldsymbol{\sigma}, \boldsymbol{F}(\ldots, \boldsymbol{u})\rangle=\sigma_{\alpha} F^{\alpha}{ }_{\beta} u^{\beta}σ,F(,u)=σαFαβuβ.
By analogy, one defines the most general tensor H H H\boldsymbol{H}H and its rank ( n m ) ( n m ) ((n)/(m))\binom{n}{m}(nm) as follows: H H H\boldsymbol{H}H is a linear machine with n n nnn input slots for n 1 n 1 n1n 1n1-forms, and m m mmm input slots for m m mmm vectors; given the requested input, it puts out a real number denoted
(3.10) H ( σ , λ , , β n 1-forms , , u , v , , w ) m vectors (3.10) H ( σ , λ , , β n  1-forms  , , u , v , , w ) m  vectors  {:(3.10)H(ubrace(sigma,lambda,dots,betaubrace)_(n" 1-forms ")","ubrace(,u,v,dots,w)ubrace)_(m" vectors "):}\begin{equation*} \boldsymbol{H}(\underbrace{\boldsymbol{\sigma}, \boldsymbol{\lambda}, \ldots, \boldsymbol{\beta}}_{n \text { 1-forms }}, \underbrace{, \boldsymbol{u}, \boldsymbol{v}, \ldots, \boldsymbol{w})}_{m \text { vectors }} \tag{3.10} \end{equation*}(3.10)H(σ,λ,,βn 1-forms ,,u,v,,w)m vectors 
For most tensors the output changes when two input vectors are interchanged,
(3.11) Riemann ( σ , u , v , w ) Riemann ( σ , v , u , w ) (3.11) Riemann ( σ , u , v , w ) Riemann ( σ , v , u , w ) {:(3.11)Riemann(sigma","u","v","w)!=Riemann(sigma","v","u","w):}\begin{equation*} \operatorname{Riemann}(\sigma, u, v, w) \neq \operatorname{Riemann}(\sigma, v, u, w) \tag{3.11} \end{equation*}(3.11)Riemann(σ,u,v,w)Riemann(σ,v,u,w)
or when two input 1 -forms are interchanged.
Choose a specific tensor S S S\boldsymbol{S}S, of rank ( 2 1 ) ( 2 1 ) ((2)/(1))\binom{2}{1}(21) for explicitness. Into the slots of S S S\boldsymbol{S}S, insert the basis vectors and 1 -forms of a specific Lorentz coordinate frame. The output is a "component of S S S\boldsymbol{S}S in that frame":
(3.12) S α β γ S ( ω α , ω β , e γ ) . (3.12) S α β γ S ω α , ω β , e γ . {:(3.12)S^(alpha beta)_(gamma)-=S(omega^(alpha),omega^(beta),e_(gamma)).:}\begin{equation*} S^{\alpha \beta}{ }_{\gamma} \equiv \boldsymbol{S}\left(\boldsymbol{\omega}^{\alpha}, \boldsymbol{\omega}^{\beta}, \boldsymbol{e}_{\gamma}\right) . \tag{3.12} \end{equation*}(3.12)SαβγS(ωα,ωβ,eγ).
This defines components. Knowing the components in a specific frame, one can easily calculate the output produced from any input forms and vectors:
(3.13) S ( σ , ρ , v ) = S ( σ α ω α , ρ β ω β , v γ e γ ) = σ α ρ β v γ S ( ω α , ω β , e γ ) = S α β γ σ α ρ β v γ . (3.13) S ( σ , ρ , v ) = S σ α ω α , ρ β ω β , v γ e γ = σ α ρ β v γ S ω α , ω β , e γ = S α β γ σ α ρ β v γ . {:[(3.13)S(sigma","rho","v)=S(sigma_(alpha)omega^(alpha),rho_(beta)omega^(beta),v^(gamma)e_(gamma))=sigma_(alpha)rho_(beta)v^(gamma)S(omega^(alpha),omega^(beta),e_(gamma))],[=S^(alpha beta)_(gamma)sigma_(alpha)rho_(beta)v^(gamma).]:}\begin{align*} \boldsymbol{S}(\boldsymbol{\sigma}, \boldsymbol{\rho}, \boldsymbol{v}) & =\boldsymbol{S}\left(\sigma_{\alpha} \boldsymbol{\omega}^{\alpha}, \rho_{\beta} \boldsymbol{\omega}^{\beta}, v^{\gamma} \boldsymbol{e}_{\gamma}\right)=\sigma_{\alpha} \rho_{\beta} v^{\gamma} \boldsymbol{S}\left(\boldsymbol{\omega}^{\alpha}, \boldsymbol{\omega}^{\beta}, \boldsymbol{e}_{\gamma}\right) \tag{3.13}\\ & =S^{\alpha \beta}{ }_{\gamma} \sigma_{\alpha} \rho_{\beta} v^{\gamma} . \end{align*}(3.13)S(σ,ρ,v)=S(σαωα,ρβωβ,vγeγ)=σαρβvγS(ωα,ωβ,eγ)=Sαβγσαρβvγ.
And knowing the components of S S S\boldsymbol{S}S in one Lorentz frame (unprimed), plus the Lorentz transformation matrices Λ α β Λ α β ||Lambda^(alpha^('))_(beta)||\left\|\Lambda^{\alpha^{\prime}}{ }_{\beta}\right\|Λαβ and Λ β α Λ β α ||Lambda^(beta)_(alpha^('))||\left\|\Lambda^{\beta}{ }_{\alpha^{\prime}}\right\|Λβα which link that frame with another (primed), one can calculate the components in the new (primed) frame. As shown in exercise 3.2 , one need only apply a matrix to each index of S S S\boldsymbol{S}S, lining up the matrix indices in the logical manner
(3.14) S μ ν λ = S α β γ Λ μ α Λ v β Λ γ λ (3.14) S μ ν λ = S α β γ Λ μ α Λ v β Λ γ λ {:(3.14)S^(mu^(')nu^('))_(lambda^('))=S^(alpha beta)_(gamma)Lambda^(mu^('))_(alpha)Lambda^(v^('))_(beta)Lambda^(gamma)_(lambda^(')):}\begin{equation*} S^{\mu^{\prime} \nu^{\prime}}{ }_{\lambda^{\prime}}=S^{\alpha \beta}{ }_{\gamma} \Lambda^{\mu^{\prime}}{ }_{\alpha} \Lambda^{v^{\prime}}{ }_{\beta} \Lambda^{\gamma}{ }_{\lambda^{\prime}} \tag{3.14} \end{equation*}(3.14)Sμνλ=SαβγΛμαΛvβΛγλ
A slight change of the internal gears and wheels inside the tensor enables one of its 1 -form slots to accept a vector. All that is necessary is a mechanism to convert an input vector n n n\boldsymbol{n}n into its corresponding 1 -form n ~ n ~ widetilde(n)\widetilde{\boldsymbol{n}}n~ and then to put that 1 -form into the old machinery. Thus, denoting the modified tensor by the same symbol S S S\boldsymbol{S}S as was used for the original tensor, one demands
(3.15) S ( σ , n , v ) = S ( σ , n ~ , v ) (3.15) S ( σ , n , v ) = S ( σ , n ~ , v ) {:(3.15)S(sigma","n","v)=S(sigma"," tilde(n)","v):}\begin{equation*} S(\sigma, n, v)=S(\sigma, \tilde{n}, v) \tag{3.15} \end{equation*}(3.15)S(σ,n,v)=S(σ,n~,v)
or, in component notation
( ) S α β γ σ α n β v γ = S α β γ σ α n β v γ . ( ) S α β γ σ α n β v γ = S α β γ σ α n β v γ . {:('")"S^(alpha)_(beta gamma)sigma_(alpha)n^(beta)v^(gamma)=S^(alpha beta)_(gamma)sigma_(alpha)n_(beta)v^(gamma).:}\begin{equation*} S^{\alpha}{ }_{\beta \gamma} \sigma_{\alpha} n^{\beta} v^{\gamma}=S^{\alpha \beta}{ }_{\gamma} \sigma_{\alpha} n_{\beta} v^{\gamma} . \tag{$\prime$} \end{equation*}()Sαβγσαnβvγ=Sαβγσαnβvγ.
This is achieved if one raises and lowers the indices of S S S\boldsymbol{S}S using the components of the metric:
(3.16) S α β γ = η β μ S α μ γ , S α μ γ = η μ β S α β γ . (3.16) S α β γ = η β μ S α μ γ , S α μ γ = η μ β S α β γ . {:(3.16)S^(alpha)_(beta gamma)=eta_(beta mu)S^(alpha mu)_(gamma)","quadS^(alpha mu)_(gamma)=eta^(mu beta)S^(alpha)_(beta gamma).:}\begin{equation*} S^{\alpha}{ }_{\beta \gamma}=\eta_{\beta \mu} S^{\alpha \mu}{ }_{\gamma}, \quad S^{\alpha \mu}{ }_{\gamma}=\eta^{\mu \beta} S^{\alpha}{ }_{\beta \gamma} . \tag{3.16} \end{equation*}(3.16)Sαβγ=ηβμSαμγ,Sαμγ=ημβSαβγ.
(See exercise 3.3 below.) By using the same symbol S S S\boldsymbol{S}S for the original tensor and
Definition of tensor as linear machine that converts vectors and 1 -forms into numbers
Components of a tensor
Tensor's machine action expressed in terms of components
Lorentz transformation of components
Modifying a tensor to accept either a vector or a 1 -form into each slot
Raising and lowering indices
the modified tensor, one allows each slot to accept either a 1 -form or a vector, so one loses sight of whether S S S\boldsymbol{S}S is a ( 2 1 ) ( 2 1 ) ((2)/(1))\binom{2}{1}(21) tensor, or a ( 1 2 ) ( 1 2 ) ((1)/(2))\binom{1}{2}(12) tensor, or a ( 3 0 ) ( 3 0 ) ((3)/(0))\binom{3}{0}(30) tensor, or a ( 0 3 ) ( 0 3 ) ((0)/(3))\binom{0}{3}(03) tensor; one only distinguishes its total rank, 3. Terminology: an "upstairs index" is called "contravariant"; a "downstairs" index is called "covariant." Thus in S α β γ S α β γ S^(alpha)_(beta gamma)S^{\alpha}{ }_{\beta \gamma}Sαβγ, " α α alpha\alphaα " is a contravariant index, while " β β beta\betaβ " and " γ γ gamma\gammaγ " are covariant indices.
Because tensors are nothing but functions, they can be added (if they have the same rank!) and multiplied by numbers in the usual way: the output of the rank-3 tensor a S + b Q a S + b Q aS+bQa \boldsymbol{S}+b \boldsymbol{Q}aS+bQ, when vectors u , v , w u , v , w u,v,w\boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w}u,v,w are put in, is
(3.17) ( a S + b Q ) ( u , v , w ) a S ( u , v , w ) + b Q ( u , v , w ) (3.17) ( a S + b Q ) ( u , v , w ) a S ( u , v , w ) + b Q ( u , v , w ) {:(3.17)(aS+bQ)(u","v","w)-=aS(u","v","w)+bQ(u","v","w):}\begin{equation*} (a \boldsymbol{S}+b \boldsymbol{Q})(\boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w}) \equiv a \boldsymbol{S}(\boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w})+b \mathbf{Q}(\boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w}) \tag{3.17} \end{equation*}(3.17)(aS+bQ)(u,v,w)aS(u,v,w)+bQ(u,v,w)
Several other important operations on tensors are explored in the following exercises. They and the results of the exercises will be used freely in the material that follows.

EXERCISES

Exercise 3.2. TRANSFORMATION LAW FOR COMPONENTS OF A TENSOR

From the transformation laws for components of vectors and 1 -forms, derive the transformation law (3.14).

Exercise 3.3. RAISING AND LOWERING INDICES

Derive equations (3.16) from equation (3.15') plus the law n α = η α β n β n α = η α β n β n_(alpha)=eta_(alpha beta)n^(beta)n_{\alpha}=\eta_{\alpha \beta} n^{\beta}nα=ηαβnβ for getting the components of the 1 -form n ~ n ~ tilde(n)\tilde{\boldsymbol{n}}n~ from the components of its corresponding vector n n n\boldsymbol{n}n.

Exercise 3.4. TENSOR PRODUCT

Given any two vectors u u u\boldsymbol{u}u and v v v\boldsymbol{v}v, one defines the second-rank tensor u v u v u ox v\boldsymbol{u} \otimes \boldsymbol{v}uv ("tensor product of u u u\boldsymbol{u}u with v v v\boldsymbol{v}v ") to be a machine, with two input slots, whose output is the number
(3.18) ( u v ) ( σ , λ ) = σ , u λ , v (3.18) ( u v ) ( σ , λ ) = σ , u λ , v {:(3.18)(u ox v)(sigma","lambda)=(:sigma","u:)(:lambda","v:):}\begin{equation*} (u \otimes v)(\sigma, \boldsymbol{\lambda})=\langle\sigma, u\rangle\langle\boldsymbol{\lambda}, \boldsymbol{v}\rangle \tag{3.18} \end{equation*}(3.18)(uv)(σ,λ)=σ,uλ,v
when 1-forms σ σ sigma\boldsymbol{\sigma}σ and λ λ lambda\boldsymbol{\lambda}λ are inserted. Show that the components T T = u v T T = u v TT=u ox v\boldsymbol{T} \boldsymbol{T}=\boldsymbol{u} \otimes \boldsymbol{v}TT=uv are the products of the components of u u u\boldsymbol{u}u and v v v\boldsymbol{v}v :
(3.19) T α β = u α v β , T α β = u α v β , T α β = u α v β . (3.19) T α β = u α v β , T α β = u α v β , T α β = u α v β . {:(3.19)T^(alpha beta)=u^(alpha)v^(beta)","quadT_(alpha)^(beta)=u_(alpha)v^(beta)","quadT_(alpha beta)=u_(alpha)v_(beta).:}\begin{equation*} T^{\alpha \beta}=u^{\alpha} v^{\beta}, \quad T_{\alpha}{ }^{\beta}=u_{\alpha} v^{\beta}, \quad T_{\alpha \beta}=u_{\alpha} v_{\beta} . \tag{3.19} \end{equation*}(3.19)Tαβ=uαvβ,Tαβ=uαvβ,Tαβ=uαvβ.
Extend the definition to several vectors and forms,
(3.20) ( u v β w ) ( σ , λ , n , ζ ) = σ , u λ , v β , n ζ , w , (3.20) ( u v β w ) ( σ , λ , n , ζ ) = σ , u λ , v β , n ζ , w , {:(3.20)(u ox v ox beta ox w)(sigma","lambda","n","zeta)=(:sigma","u:)(:lambda","v:)(:beta","n:)(:zeta","w:)",":}\begin{equation*} (u \otimes v \otimes \beta \otimes w)(\sigma, \lambda, n, \zeta)=\langle\sigma, u\rangle\langle\lambda, v\rangle\langle\beta, n\rangle\langle\zeta, w\rangle, \tag{3.20} \end{equation*}(3.20)(uvβw)(σ,λ,n,ζ)=σ,uλ,vβ,nζ,w,
and show that the product rule for components still holds:
(3.21) S = u v β w has components S λ μ μ ν = u μ v ν β λ w ζ . (3.21) S = u v β w  has components  S λ μ μ ν = u μ v ν β λ w ζ . {:[(3.21)S=u ox v ox beta ox w" has components "],[S_(lambda^(mu))^(mu nu)=u^(mu)v^(nu)beta_(lambda)w^(zeta).]:}\begin{gather*} \boldsymbol{S}=\boldsymbol{u} \otimes \boldsymbol{v} \otimes \boldsymbol{\beta} \otimes \boldsymbol{w} \text { has components } \tag{3.21}\\ S_{\lambda^{\mu}}^{\mu \nu}=u^{\mu} v^{\nu} \beta_{\lambda} w^{\zeta} . \end{gather*}(3.21)S=uvβw has components Sλμμν=uμvνβλwζ.

Exercise 3.5. BASIS TENSORS

Show that a tensor M M M\boldsymbol{M}M with components M α β γ δ M α β γ δ M^(alpha beta)gamma^(delta)M^{\alpha \beta} \gamma^{\delta}Mαβγδ in a given Lorentz frame can be reconstructed from its components and from the basis 1 -forms and vectors of that frame as follows:
(3.22) M = M α β γ δ e α e β ω γ e δ (3.22) M = M α β γ δ e α e β ω γ e δ {:(3.22)M=M^(alpha beta)gamma^(delta)e_(alpha)oxe_(beta)oxomega^(gamma)oxe_(delta):}\begin{equation*} \boldsymbol{M}=M^{\alpha \beta} \gamma^{\delta} \boldsymbol{e}_{\alpha} \otimes \boldsymbol{e}_{\beta} \otimes \boldsymbol{\omega}^{\gamma} \otimes \boldsymbol{e}_{\delta} \tag{3.22} \end{equation*}(3.22)M=Mαβγδeαeβωγeδ
(For a special case of this, see Box 3.2.)

Box 3.2 THE METRIC IN DIFFERENT LANGUAGES

A. Geometric Language

g g g\boldsymbol{g}g is a linear, symmetric machine with two slots for insertion of vectors. When vectors u u u\boldsymbol{u}u and v v v\boldsymbol{v}v are inserted, the output of g g g\boldsymbol{g}g is their scalar product:
g ( u , v ) = u v g ( u , v ) = u v g(u,v)=u*v\boldsymbol{g}(\boldsymbol{u}, \boldsymbol{v})=\boldsymbol{u} \cdot \boldsymbol{v}g(u,v)=uv

B. Component Language

η μ ν η μ ν eta_(mu nu)\eta_{\mu \nu}ημν are the metric components. They are used to calculate the scalar product of two vectors from components in a specific Lorentz frame:
u v = η μ ν u μ v ν u v = η μ ν u μ v ν u*v=eta_(mu nu)u^(mu)v^(nu)\boldsymbol{u} \cdot \boldsymbol{v}=\eta_{\mu \nu} u^{\mu} v^{\nu}uv=ημνuμvν

C. Coordinate-Based Geometric Language

The metric g g g\boldsymbol{g}g can be written, in terms of basis 1-forms of a specific Lorentz frame, as
g = η μ ν ω μ ω ν = η μ ν d x μ d x ν g = η μ ν ω μ ω ν = η μ ν d x μ d x ν g=eta_(mu nu)omega^(mu)oxomega^(nu)=eta_(mu nu)dx^(mu)ox dx^(nu)\boldsymbol{g}=\eta_{\mu \nu} \boldsymbol{\omega}^{\mu} \otimes \boldsymbol{\omega}^{\nu}=\eta_{\mu \nu} \boldsymbol{d} x^{\mu} \otimes \boldsymbol{d} x^{\nu}g=ημνωμων=ημνdxμdxν
[see equations (2.18) and (3.22)].

D. Connection to the Elementary Concept of Line Element

Box 2.3 demonstrated the correspondence between the gradient d f d f df\boldsymbol{d} fdf of a function, and the elementary concept d f d f dfd fdf of a differential change of f f fff in some unspecified direction. There is a similar correspondence between the metric, written as η μ ν d x μ η μ ν d x μ eta_(mu nu)dx^(mu)\eta_{\mu \nu} \boldsymbol{d} x^{\mu}ημνdxμ d x ν d x ν ox dx^(nu)\otimes \boldsymbol{d} x^{\nu}dxν, and the elementary concept of "line element," written as d s 2 = η μ ν d x μ d x ν d s 2 = η μ ν d x μ d x ν ds^(2)=eta_(mu nu)dx^(mu)dx^(nu)d s^{2}=\eta_{\mu \nu} d x^{\mu} d x^{\nu}ds2=ημνdxμdxν. This elementary line element, as expounded in many special relativity texts, represents the squared length of the displacement " d x d x dxd xdx " " in an unspecified direction. The metric η μ ν d x μ d x ν η μ ν d x μ d x ν eta_(mu nu)dx^(mu)ox dx^(nu)\eta_{\mu \nu} \boldsymbol{d} x^{\mu} \otimes \boldsymbol{d} x^{\nu}ημνdxμdxν does the same. Pick a specific infinitesimal displacement vector ξ ξ xi\xiξ, and insert it into the slots of η μ ν d x μ d x ν η μ ν d x μ d x ν eta_(mu nu)dx^(mu)ox dx^(nu)\eta_{\mu \nu} \boldsymbol{d} x^{\mu} \otimes \boldsymbol{d} x^{\nu}ημνdxμdxν. The output will be ξ 2 = η μ ν ξ μ ξ ν ξ 2 = η μ ν ξ μ ξ ν xi^(2)=eta_(mu nu)xi^(mu)xi^(nu)\xi^{2}=\eta_{\mu \nu} \xi^{\mu} \xi^{\nu}ξ2=ημνξμξν, the squared length of the displacement. Before ξ ξ xi\xiξ is inserted, η μ ν d x μ d x ν η μ ν d x μ d x ν eta_(mu nu)dx^(mu)ox dx^(nu)\eta_{\mu \nu} \boldsymbol{d} x^{\mu} \otimes \boldsymbol{d} x^{\nu}ημνdxμdxν has the potential to tell the squared length of any vector; the insertion of ξ ξ xi\boldsymbol{\xi}ξ converts potentiality into actuality: the numerical value of ξ 2 ξ 2 xi^(2)\xi^{2}ξ2.
Because the metric η μ ν d x μ d x ν η μ ν d x μ d x ν eta_(mu nu)dx^(mu)ox dx^(nu)\eta_{\mu \nu} \boldsymbol{d} x^{\mu} \otimes \boldsymbol{d} x^{\nu}ημνdxμdxν and the line element d s 2 = η μ ν d x μ d x ν d s 2 = η μ ν d x μ d x ν ds^(2)=eta_(mu nu)dx^(mu)dx^(nu)d s^{2}=\eta_{\mu \nu} d x^{\mu} d x^{\nu}ds2=ημνdxμdxν perform this same function of representing the squared length of an unspecified infinitesimal displacement, there is no conceptual distinction between them. One sometimes uses the symbols d s 2 d s 2 ds^(2)\boldsymbol{d} \boldsymbol{s}^{2}ds2 to denote the metric; one sometimes gets pressed and writes it as d s 2 = η μ ν d x μ d x ν d s 2 = η μ ν d x μ d x ν ds^(2)=eta_(mu nu)dx^(mu)dx^(nu)\boldsymbol{d} \boldsymbol{s}^{2}=\eta_{\mu \nu} \boldsymbol{d} x^{\mu} \boldsymbol{d} x^{\nu}ds2=ημνdxμdxν, omitting the " ox\otimes "; and one sometimes even gets so pressed as to use nonbold characters, so that no notational distinction remains at all between metric and elementary line element:
g = d s 2 = d s 2 = η μ ν d x μ d x ν g = d s 2 = d s 2 = η μ ν d x μ d x ν g=ds^(2)=ds^(2)=eta_(mu nu)dx^(mu)dx^(nu)\boldsymbol{g}=\boldsymbol{d} \boldsymbol{s}^{2}=d s^{2}=\eta_{\mu \nu} d x^{\mu} d x^{\nu}g=ds2=ds2=ημνdxμdxν

Exercise 3.6. Faraday MACHINERY AT WORK

An observer with 4 -velocity u u u\boldsymbol{u}u picks out three directions in spacetime that are orthogonal and purely spatial (no time part) as seen in his frame. Let e 1 ^ , e 2 ^ , e 3 ^ e 1 ^ , e 2 ^ , e 3 ^ e_( hat(1)),e_( hat(2)),e_( hat(3))\boldsymbol{e}_{\hat{1}}, \boldsymbol{e}_{\hat{2}}, \boldsymbol{e}_{\hat{3}}e1^,e2^,e3^ be unit vectors in those directions and let them be oriented in a righthanded way ( e 1 e 2 × e 3 = + 1 e 1 e 2 × e 3 = + 1 (e_(1)*e_(2)xxe_(3)=+1:}\left(\boldsymbol{e}_{1} \cdot \boldsymbol{e}_{2} \times \boldsymbol{e}_{3}=+1\right.(e1e2×e3=+1 in three-dimensional language). Why do the following relations hold?
e j u = 0 , e j e k = δ j k e j u = 0 , e j e k = δ j k e_(j)*u=0,quade_(j)*e_(k)=delta_(jk)\boldsymbol{e}_{j} \cdot \boldsymbol{u}=0, \quad \boldsymbol{e}_{j} \cdot \boldsymbol{e}_{k}=\delta_{j k}eju=0,ejek=δjk
What vectors are to be inserted in the two slots of the electromagnetic field tensor Faraday if one wants to get out the electric field along e j e j e_(j)\boldsymbol{e}_{j}ej as measured by this observer? What vectors must be inserted to get the magnetic field he measures along e j e j e_(j)\boldsymbol{e}_{j}ej ?

§3.3. THREE-PLUS-ONE VIEW VERSUS GEOMETRIC VIEW

The power of the geometric view of physics
Example of electromagnetism
Transformation law for electric and magnetic fields
Great computational and conceptual power resides in Einstein's geometric view of physics. Ideas that seem complex when viewed in the everyday "space-plus-time" or " 3 + 1 3 + 1 3+13+13+1 " manner become elegant and simple when viewed as relations between geometric objects in four-dimensional spacetime. Derivations that are difficult in 3 + 1 3 + 1 3+13+13+1 language simplify in geometric language.
The electromagnetic field is a good example. In geometric language, it is described by a second-rank, antisymmetric tensor ("2-form") F F F\boldsymbol{F}F, which requires no coordinates for its definition. This tensor produces a 4 -force on any charged particle given by
d p / d τ = e F ( u ) . d p / d τ = e F ( u ) . dp//d tau=eF(u).d \boldsymbol{p} / d \tau=e \boldsymbol{F}(\boldsymbol{u}) .dp/dτ=eF(u).
It is all so simple!
By contrast, consider the " 3 + 1 3 + 1 3+13+13+1 " viewpoint. In a given Lorentz frame, there is an electric field E E E\boldsymbol{E}E and a magnetic field B B B\boldsymbol{B}B. They push on a particle in accordance with
d p / d t = e ( E + v × B ) d p / d t = e ( E + v × B ) dp//dt=e(E+v xx B)d \boldsymbol{p} / d t=e(\boldsymbol{E}+\boldsymbol{v} \times \boldsymbol{B})dp/dt=e(E+v×B)
But the values of p , E , v p , E , v p,E,v\boldsymbol{p}, \boldsymbol{E}, \boldsymbol{v}p,E,v, and B B B\boldsymbol{B}B all change when one passes from the given Lorentz frame to a new one. For example, the electric and magnetic fields viewed from a rocket ship ("barred" frame) are related to those viewed in the laboratory ("unbarred" frame) by
(3.23) E = E , E = 1 1 β 2 ( E + β × B ) , B = B , B = 1 1 β 2 ( B β × E ) . (3.23) E ¯ = E , E ¯ = 1 1 β 2 E + β × B , B ¯ = B , B ¯ = 1 1 β 2 B β × E . {:(3.23){:[ bar(E)_(||)=E_(||)",", bar(E)_(_|_)=(1)/(sqrt(1-beta^(2)))(E_(_|_)+beta xxB_(_|_))","],[ bar(B)_(||)=B_(||)",", bar(B)_(_|_)=(1)/(sqrt(1-beta^(2)))(B_(_|_)-beta xxE_(_|_)).]:}:}\begin{array}{ll} \overline{\boldsymbol{E}}_{\|}=\boldsymbol{E}_{\|}, & \overline{\boldsymbol{E}}_{\perp}=\frac{1}{\sqrt{1-\beta^{2}}}\left(\boldsymbol{E}_{\perp}+\boldsymbol{\beta} \times \boldsymbol{B}_{\perp}\right), \\ \overline{\boldsymbol{B}}_{\|}=\boldsymbol{B}_{\|}, & \overline{\boldsymbol{B}}_{\perp}=\frac{1}{\sqrt{1-\beta^{2}}}\left(\boldsymbol{B}_{\perp}-\boldsymbol{\beta} \times \boldsymbol{E}_{\perp}\right) . \tag{3.23} \end{array}(3.23)E=E,E=11β2(E+β×B),B=B,B=11β2(Bβ×E).
(Here "Il" means component along direction of rocket's motion; " _|_\perp " means perpendicular component; and β j = d x j β j = d x j beta^(j)=dx^(j)\beta^{j}=d x^{j}βj=dxj rocket / d t / d t //dt/ d t/dt is the rocket's ordinary velocity.) The analogous transformation laws for the particle's momentum p p p\boldsymbol{p}p and ordinary velocity
v v v\boldsymbol{v}v, and for the coordinate time t t ttt, all conspire-as if by magic, it seems, from the 3 + 1 3 + 1 3+13+13+1 viewpoint-to maintain the validity of the Lorentz force law in all frames.
Not only is the geometric view far simpler than the 3 + 1 3 + 1 3+13+13+1 view, it can even derive the 3 + 1 3 + 1 3+13+13+1 equations with greater ease than can the 3 + 1 3 + 1 3+13+13+1 view itself. Consider, for example, the transformation law (3.23) for the electric and magnetic fields. The geometric view derives it as follows: (1) Orient the axes of the two frames so their relative motion is in the z z zzz-direction. (2) Perform a simple Lorentz transformation (equation 2.45) on the components of the electromagnetic field tensor:
E ¯ = E ¯ z = F 30 = Λ α 3 Λ β 0 F α β = γ 2 F 30 + β 2 γ 2 F 03 (3.24) = ( 1 β 2 ) γ 2 F 30 = F 30 = E z = E , E ¯ x = F 10 = Λ α 1 Λ β 0 F α β = γ F 10 + β γ F 13 = γ ( E x β B y ) , etc. E ¯ = E ¯ z = F 30 ¯ = Λ α 3 Λ β 0 ¯ F α β = γ 2 F 30 + β 2 γ 2 F 03 (3.24) = 1 β 2 γ 2 F 30 = F 30 = E z = E , E ¯ x = F 10 ¯ = Λ α 1 Λ β 0 ¯ F α β = γ F 10 + β γ F 13 = γ E x β B y ,  etc.  {:[ bar(E)_(||)= bar(E)_(z)=F_( bar(30))=Lambda^(alpha)_(3)Lambda^(beta)_( bar(0))F_(alpha beta)=gamma^(2)F_(30)+beta^(2)gamma^(2)F_(03)],[(3.24)=(1-beta^(2))gamma^(2)F_(30)=F_(30)=E_(z)=E_(||)","],[ bar(E)_(x)=F_( bar(10))=Lambda^(alpha)_(1)Lambda^(beta)_( bar(0))F_(alpha beta)=gammaF_(10)+beta gammaF_(13)=gamma(E_(x)-betaB_(y))","],[" etc. "]:}\begin{align*} & \bar{E}_{\|}=\bar{E}_{z}=F_{\overline{30}}=\Lambda^{\alpha}{ }_{3} \Lambda^{\beta}{ }_{\overline{0}} F_{\alpha \beta}=\gamma^{2} F_{30}+\beta^{2} \gamma^{2} F_{03} \\ &=\left(1-\beta^{2}\right) \gamma^{2} F_{30}=F_{30}=E_{z}=E_{\|}, \tag{3.24}\\ & \bar{E}_{x}=F_{\overline{10}}=\Lambda^{\alpha}{ }_{1} \Lambda^{\beta}{ }_{\overline{0}} F_{\alpha \beta}=\gamma F_{10}+\beta \gamma F_{13}=\gamma\left(E_{x}-\beta B_{y}\right), \\ & \text { etc. } \end{align*}E¯=E¯z=F30=Λα3Λβ0Fαβ=γ2F30+β2γ2F03(3.24)=(1β2)γ2F30=F30=Ez=E,E¯x=F10=Λα1Λβ0Fαβ=γF10+βγF13=γ(ExβBy), etc. 
By contrast, the 3 + 1 3 + 1 3+13+13+1 view shows much more work. A standard approach is based on the Lorentz force law and energy-change law ( 3.2 a , b 3.2 a , b 3.2a,b3.2 \mathrm{a}, \mathrm{b}3.2a,b ), written in the slightly modified form
(3.25) m d 2 x ¯ d τ 2 = e ( E ¯ x d t ¯ d τ + 0 d x ¯ d τ + B ¯ z d y ¯ d τ B ¯ y d z ¯ d τ ) (3.25) m d 2 x ¯ d τ 2 = e E ¯ x d t ¯ d τ + 0 d x ¯ d τ + B ¯ z d y ¯ d τ B ¯ y d z ¯ d τ {:(3.25)m(d^(2)( bar(x)))/(dtau^(2))=e( bar(E)_(x)(d( bar(t)))/(d tau)+0(d( bar(x)))/(d tau)+ bar(B)_(z)(d( bar(y)))/(d tau)- bar(B)_(y)(d( bar(z)))/(d tau)):}\begin{equation*} m \frac{d^{2} \bar{x}}{d \tau^{2}}=e\left(\bar{E}_{x} \frac{d \bar{t}}{d \tau}+0 \frac{d \bar{x}}{d \tau}+\bar{B}_{z} \frac{d \bar{y}}{d \tau}-\bar{B}_{y} \frac{d \bar{z}}{d \tau}\right) \tag{3.25} \end{equation*}(3.25)md2x¯dτ2=e(E¯xdt¯dτ+0dx¯dτ+B¯zdy¯dτB¯ydz¯dτ)
... (three additional equations) ....
It proceeds as follows (details omitted because of their great length!):
(1) Substitute for the d 2 x ¯ / d τ 2 d 2 x ¯ / d τ 2 d^(2) bar(x)//dtau^(2)d^{2} \bar{x} / d \tau^{2}d2x¯/dτ2, etc., the expression for these quantities in terms of the d 2 x / d τ 2 , d 2 x / d τ 2 , d^(2)x//dtau^(2),dotsd^{2} x / d \tau^{2}, \ldotsd2x/dτ2, (Lorentz transformation).
(2) Substitute for the d 2 x / d τ 2 , d 2 x / d τ 2 , d^(2)x//dtau^(2),dotsd^{2} x / d \tau^{2}, \ldotsd2x/dτ2, the expression for these accelerations in terms of the laboratory E E E\boldsymbol{E}E and B B B\boldsymbol{B}B (Lorentz force law).
(3) In these expressions, wherever the components d x / d τ d x / d τ dx//d taud x / d \taudx/dτ of the 4 -velocity in the laboratory frame appear, substitute expressions in terms of the 4 -velocities in the rocket frame (inverse Lorentz transformation).
(4) In (3.25) as thus transformed, demand equality of left and right sides for all values of the d x ¯ / d τ d x ¯ / d τ d bar(x)//d taud \bar{x} / d \taudx¯/dτ, etc. (validity for all test particles).
(5) In this way arrive at the expressions (3.23) for the E E ¯ bar(E)\overline{\boldsymbol{E}}E and B B ¯ bar(B)\overline{\boldsymbol{B}}B in terms of the E E E\boldsymbol{E}E and B B B\boldsymbol{B}B.
The contrast in difficulty is obvious!

§3.4. MAXVELL'S EQUATIONS

Turn now from the action of the field on a charge, and ask about the action of a charge on the field, or, more generally, ask about the dynamics of the electromagnetic
Magnetodynamics derived from magnetostatics
Magnetodynamics and magnetostatics unified in one geometric law
field, charge or no charge. Begin with the simplest of Maxwell's equations in a specific Lorentz frame, the one that says there are no free magnetic poles:
(3.26) B div B = B x x + B y y + B z z = 0 . (3.26) B div B = B x x + B y y + B z z = 0 . {:(3.26)grad*B-=div B=(delB_(x))/(del x)+(delB_(y))/(del y)+(delB_(z))/(del z)=0.:}\begin{equation*} \boldsymbol{\nabla} \cdot \boldsymbol{B} \equiv \operatorname{div} \boldsymbol{B}=\frac{\partial B_{x}}{\partial x}+\frac{\partial B_{y}}{\partial y}+\frac{\partial B_{z}}{\partial z}=0 . \tag{3.26} \end{equation*}(3.26)BdivB=Bxx+Byy+Bzz=0.
This statement has to be true in all Lorentz frames. It is therefore true in the rocket frame:
(3.27) B ¯ x x ¯ + B ¯ y y ¯ + B ¯ z z ¯ = 0 . (3.27) B ¯ x x ¯ + B ¯ y y ¯ + B ¯ z z ¯ = 0 . {:(3.27)(del bar(B)_(x))/(del( bar(x)))+(del bar(B)_(y))/(del( bar(y)))+(del bar(B)_(z))/(del( bar(z)))=0.:}\begin{equation*} \frac{\partial \bar{B}_{x}}{\partial \bar{x}}+\frac{\partial \bar{B}_{y}}{\partial \bar{y}}+\frac{\partial \bar{B}_{z}}{\partial \bar{z}}=0 . \tag{3.27} \end{equation*}(3.27)B¯xx¯+B¯yy¯+B¯zz¯=0.
For an infinitesimal Lorentz transformation in the x x xxx-direction (nonrelativistic velocity β β beta\betaβ ), one has (see Box 2.4 and equations 3.23)
(3.28) B ¯ x = B x , B ¯ y = B y + β E z , B ¯ z = B z β E y (3.29) x ¯ = x + β t , y ¯ = y , z ¯ = z (3.28) B ¯ x = B x , B ¯ y = B y + β E z , B ¯ z = B z β E y (3.29) x ¯ = x + β t , y ¯ = y , z ¯ = z {:[(3.28) bar(B)_(x)=B_(x)","quad bar(B)_(y)=B_(y)+betaE_(z)","quad bar(B)_(z)=B_(z)-betaE_(y)],[(3.29)(del)/(del( bar(x)))=(del)/(del x)+beta(del)/(del t)","quad(del)/(del( bar(y)))=(del)/(del y)","quad(del)/(del( bar(z)))=(del)/(del z)]:}\begin{gather*} \bar{B}_{x}=B_{x}, \quad \bar{B}_{y}=B_{y}+\beta E_{z}, \quad \bar{B}_{z}=B_{z}-\beta E_{y} \tag{3.28}\\ \frac{\partial}{\partial \bar{x}}=\frac{\partial}{\partial x}+\beta \frac{\partial}{\partial t}, \quad \frac{\partial}{\partial \bar{y}}=\frac{\partial}{\partial y}, \quad \frac{\partial}{\partial \bar{z}}=\frac{\partial}{\partial z} \tag{3.29} \end{gather*}(3.28)B¯x=Bx,B¯y=By+βEz,B¯z=BzβEy(3.29)x¯=x+βt,y¯=y,z¯=z
Substitute into the condition of zero divergence in the rocket frame. Recover the original condition of zero divergence in the laboratory frame, plus the following additional information (requirement for the vanishing of the coefficient of the arbitrary small velocity β β beta\betaβ ):
(3.30) B x t + E z y E y z = 0 . (3.30) B x t + E z y E y z = 0 . {:(3.30)(delB_(x))/(del t)+(delE_(z))/(del y)-(delE_(y))/(del z)=0.:}\begin{equation*} \frac{\partial B_{x}}{\partial t}+\frac{\partial E_{z}}{\partial y}-\frac{\partial E_{y}}{\partial z}=0 . \tag{3.30} \end{equation*}(3.30)Bxt+EzyEyz=0.
Had the velocity of transformation been directed in the y y yyy - or z z zzz-directions, a similar equation would have been obtained for B y / t B y / t delB_(y)//del t\partial B_{y} / \partial tBy/t or B z / t B z / t delB_(z)//del t\partial B_{z} / \partial tBz/t. In the language of threedimensional vectors, these three equations reduce to the one equation
(3.31) B t + × E B t + curl E = 0 (3.31) B t + × E B t + curl E = 0 {:(3.31)(del B)/(del t)+grad xx E-=(del B)/(del t)+curl E=0:}\begin{equation*} \frac{\partial \boldsymbol{B}}{\partial t}+\boldsymbol{\nabla} \times \boldsymbol{E} \equiv \frac{\partial \boldsymbol{B}}{\partial t}+\operatorname{curl} \boldsymbol{E}=0 \tag{3.31} \end{equation*}(3.31)Bt+×EBt+curlE=0
How beautiful that (1) the principle of covariance (laws of physics are the same in every Lorentz reference system, which is equivalent to the geometric view of physics) plus (2) the principle that magnetic tubes of force never end, gives (3) Maxwell's dynamic law for the time-rate of change of the magnetic field! This suggests that the magnetostatic law B = 0 B = 0 grad*B=0\boldsymbol{\nabla} \cdot \boldsymbol{B}=0B=0 and the magnetodynamic law B / t + × E = 0 B / t + × E = 0 del B//del t+grad xx E=0\partial \boldsymbol{B} / \partial t+\boldsymbol{\nabla} \times \boldsymbol{E}=0B/t+×E=0 must be wrapped up together in a single frame-independent, geometric law. In terms of components of the field tensor F F F\boldsymbol{F}F, that geometric law must read
(3.32) F α β , γ + F β γ , α + F γ α , β = 0 , (3.32) F α β , γ + F β γ , α + F γ α , β = 0 , {:(3.32)F_(alpha beta,gamma)+F_(beta gamma,alpha)+F_(gamma alpha,beta)=0",":}\begin{equation*} F_{\alpha \beta, \gamma}+F_{\beta \gamma, \alpha}+F_{\gamma \alpha, \beta}=0, \tag{3.32} \end{equation*}(3.32)Fαβ,γ+Fβγ,α+Fγα,β=0,
since this reduces to B = 0 B = 0 grad*B=0\boldsymbol{\nabla} \cdot \boldsymbol{B}=0B=0 when one takes α = 1 , β = 2 , γ = 3 α = 1 , β = 2 , γ = 3 alpha=1,beta=2,gamma=3\alpha=1, \beta=2, \gamma=3α=1,β=2,γ=3; and it reduces to B / t + × E = 0 B / t + × E = 0 del B//del t+grad xx E=0\partial \boldsymbol{B} / \partial t+\boldsymbol{\nabla} \times \boldsymbol{E}=0B/t+×E=0 when one sets any index, e.g., α α alpha\alphaα, equal to zero (see exercise 3.7 below). In frame-independent geometric language, this law is written (see § 3.5 § 3.5 §3.5\S 3.5§3.5, exercise 3.14, and Chapter 4 for notation)
(3.33) d F = 0 , or, equivalently, F = 0 ; (3.33) d F = 0 , or, equivalently,  F = 0 ; {:(3.33)dF=0", or, equivalently, "grad***F=0;:}\begin{equation*} \boldsymbol{d} \boldsymbol{F}=0 \text {, or, equivalently, } \boldsymbol{\nabla} \cdot \boldsymbol{*} \boldsymbol{F}=0 ; \tag{3.33} \end{equation*}(3.33)dF=0, or, equivalently, F=0;
and it says, "Take the electromagnetic 2 -form F F F\boldsymbol{F}F (a geometric object defined even in absence of coordinates); from it construct a new geometric object d F d F dF\boldsymbol{d F}dF (called the "exterior derivative of F F F\boldsymbol{F}F "); d F d F dF\boldsymbol{d} \boldsymbol{F}dF must vanish. The details of this coordinate-free process will be spelled out in exercise 3.15 and in § 4.5 § 4.5 §4.5\S 4.5§4.5 (track 2).
Two of Maxwell's equations remain: the electrostatic equation
(3.34) E = 4 π ρ , (3.34) E = 4 π ρ , {:(3.34)grad*E=4pi rho",":}\begin{equation*} \boldsymbol{\nabla} \cdot \boldsymbol{E}=4 \pi \rho, \tag{3.34} \end{equation*}(3.34)E=4πρ,
and the electrodynamic equation
(3.35) E / t × B = 4 π J . (3.35) E / t × B = 4 π J . {:(3.35)del E//del t-grad xx B=-4pi J.:}\begin{equation*} \partial \boldsymbol{E} / \partial t-\boldsymbol{\nabla} \times \boldsymbol{B}=-4 \pi J . \tag{3.35} \end{equation*}(3.35)E/t×B=4πJ.
They, like the magnetostatic and magnetodynamic equations, are actually two different parts of a single geometric law. Written in terms of field components, that law says
(3.36) F , β α β = 4 π J α , (3.36) F , β α β = 4 π J α , {:(3.36)F_(,beta)^(alpha beta)=4piJ^(alpha)",":}\begin{equation*} F_{, \beta}^{\alpha \beta}=4 \pi J^{\alpha}, \tag{3.36} \end{equation*}(3.36)F,βαβ=4πJα,
where the components of the "4-current" J J J\boldsymbol{J}J are
J 0 = ρ = charge density (3.37) ( J 1 , J 2 , J 3 ) = components of current density. J 0 = ρ =  charge density  (3.37) J 1 , J 2 , J 3 =  components of current density.  {:[J^(0)=rho=" charge density "],[(3.37)(J^(1),J^(2),J^(3))=" components of current density. "]:}\begin{align*} J^{0}=\rho & =\text { charge density } \\ \left(J^{1}, J^{2}, J^{3}\right) & =\text { components of current density. } \tag{3.37} \end{align*}J0=ρ= charge density (3.37)(J1,J2,J3)= components of current density. 
Written in coordinate-free, geometric language, this electrodynamic law says
(3.38) d F = 4 π J or, equivalently, F = 4 π J (3.38) d F = 4 π J  or, equivalently,  F = 4 π J {:(3.38)d^(**)F=4pi^(**)J" or, equivalently, "grad*F=4pi J:}\begin{equation*} \boldsymbol{d}^{*} \boldsymbol{F}=4 \pi^{*} \boldsymbol{J} \text { or, equivalently, } \boldsymbol{\nabla} \cdot \boldsymbol{F}=4 \pi \boldsymbol{J} \tag{3.38} \end{equation*}(3.38)dF=4πJ or, equivalently, F=4πJ
(For full discussion, see exercise 3.15 and § 4.5 § 4.5 §4.5\S 4.5§4.5, which is on Track 2.)

Exercise 3.7. MAXWELL'S EQUATIONS

EXERCISE

Show, by explicit examination of components, that the geometric laws
F α β , γ + F β γ , α + F γ α , β = 0 , F α β , β = 4 π J α , F α β , γ + F β γ , α + F γ α , β = 0 , F α β , β = 4 π J α , F_(alpha beta,gamma)+F_(beta gamma,alpha)+F_(gamma alpha,beta)=0,quadF^(alpha beta)_(,beta)=4piJ^(alpha),F_{\alpha \beta, \gamma}+F_{\beta \gamma, \alpha}+F_{\gamma \alpha, \beta}=0, \quad F^{\alpha \beta}{ }_{, \beta}=4 \pi J^{\alpha},Fαβ,γ+Fβγ,α+Fγα,β=0,Fαβ,β=4πJα,
do reduce to Maxwell's equations (3.26), (3.31), (3.34), (3.35), as claimed above.

§3.5 WORKING WITH TENSORS

Another mathematical digression is needed. Given an arbitrary tensor field, s s s\boldsymbol{s}s, of arbitrary rank (choose rank = 3 = 3 =3=3=3 for concreteness), one can construct new tensor fields by a variety of operations.
One operation is the gradient grad\mathbf{\nabla}. (The symbol d d d\boldsymbol{d}d is reserved for gradients of scalars,
Ways to produce new tensors from old:
Gradient
in which case f d f f d f grad f-=df\boldsymbol{\nabla} f \equiv \boldsymbol{d} ffdf, and for "exterior derivatives of differential forms;" a Track-2
Electrodynamics and electrostatics unified in one geometric law
concept, on which see §4.5.) Like S , S S , S S,grad S\boldsymbol{S}, \boldsymbol{\nabla} \boldsymbol{S}S,S is a machine. It has four slots, whereas S S S\boldsymbol{S}S has three. It describes how S S S\boldsymbol{S}S changes from point to point. Specifically, if one desires to know how the number S ( u , v , w ) S ( u , v , w ) S(u,v,w)\boldsymbol{S}(\boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w})S(u,v,w) for fixed u , v , w u , v , w u,v,w\boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w}u,v,w changes under a displacement ξ ξ xi\boldsymbol{\xi}ξ, one inserts u , v , w , ξ u , v , w , ξ u,v,w,xi\boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w}, \boldsymbol{\xi}u,v,w,ξ into the four slots of S S grad S\boldsymbol{\nabla} \boldsymbol{S}S :
S ( u , v , w , ξ ) ξ S ( u , v , w ) with u , v , w fixed (3.39) + [ value of S ( u , v , w ) at tip of ξ ] [ value of S ( u , v , w ) at tail of ξ ] . S ( u , v , w , ξ ) ξ S ( u , v , w )  with  u , v , w  fixed  (3.39) + [  value of  S ( u , v , w )  at tip of  ξ ] [  value of  S ( u , v , w )  at tail of  ξ ] . {:[grad S(u","v","w","xi)-=del_(xi)S(u","v","w)" with "u","v","w" fixed "],[(3.39)≃+[" value of "S(u","v","w)" at tip of "xi]],[-[" value of "S(u","v","w)" at tail of "xi].]:}\begin{align*} \boldsymbol{\nabla} \boldsymbol{S}(\boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w}, \boldsymbol{\xi}) \equiv & \partial_{\xi} \boldsymbol{S}(\boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w}) \text { with } \boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w} \text { fixed } \\ \simeq & +[\text { value of } \boldsymbol{S}(\boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w}) \text { at tip of } \xi] \tag{3.39}\\ & -[\text { value of } \boldsymbol{S}(\boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w}) \text { at tail of } \xi] . \end{align*}S(u,v,w,ξ)ξS(u,v,w) with u,v,w fixed (3.39)+[ value of S(u,v,w) at tip of ξ][ value of S(u,v,w) at tail of ξ].
In component notation in a Lorentz frame, this says
S ( u , v , w , ξ ) ξ ( S α β γ u α v β w γ ) = ( S α β γ x δ ξ δ ) u α v β w γ = S α β γ , δ u α v β w γ ξ δ . S ( u , v , w , ξ ) ξ S α β γ u α v β w γ = S α β γ x δ ξ δ u α v β w γ = S α β γ , δ u α v β w γ ξ δ . {:[grad S(u","v","w","xi)-=del_(xi)(S_(alpha beta gamma)u^(alpha)v^(beta)w^(gamma))=((delS_(alpha beta gamma))/(delx^(delta))xi^(delta))u^(alpha)v^(beta)w^(gamma)],[=S_(alpha beta gamma,delta)u^(alpha)v^(beta)w^(gamma)xi^(delta).]:}\begin{aligned} \boldsymbol{\nabla} \boldsymbol{S}(\boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w}, \xi) & \equiv \partial_{\boldsymbol{\xi}}\left(S_{\alpha \beta \gamma} u^{\alpha} v^{\beta} w^{\gamma}\right)=\left(\frac{\partial S_{\alpha \beta \gamma}}{\partial x^{\delta}} \xi^{\delta}\right) u^{\alpha} v^{\beta} w^{\gamma} \\ & =S_{\alpha \beta \gamma, \delta} u^{\alpha} v^{\beta} w^{\gamma} \xi^{\delta} . \end{aligned}S(u,v,w,ξ)ξ(Sαβγuαvβwγ)=(Sαβγxδξδ)uαvβwγ=Sαβγ,δuαvβwγξδ.
Thus, the Lorentz-frame components of S S grad S\boldsymbol{\nabla} \boldsymbol{S}S are nothing but the partial derivatives of the components of S S S\boldsymbol{S}S. Notice that the gradient raises the rank of a tensor by 1 (from 3 to 4 for S S S\boldsymbol{S}S ).
Contraction is another process that produces a new tensor from an old one. It seals off ("contracts") two of the old tensor's slots, thereby reducing the rank by two. Specifically, if R R R\boldsymbol{R}R is a fourth-rank tensor and M M M\boldsymbol{M}M is obtained by contracting the first and third slots of R R R\boldsymbol{R}R, then the output of M M M\boldsymbol{M}M is given by (definition!)
(3.40) M ( u , v ) = α = 0 3 R ( e α , u , ω α , v ) (3.40) M ( u , v ) = α = 0 3 R e α , u , ω α , v {:(3.40)M(u","v)=sum_(alpha=0)^(3)R(e_(alpha),u,omega^(alpha),v):}\begin{equation*} \boldsymbol{M}(\boldsymbol{u}, \boldsymbol{v})=\sum_{\alpha=0}^{3} \boldsymbol{R}\left(\boldsymbol{e}_{\alpha}, \boldsymbol{u}, \boldsymbol{\omega}^{\alpha}, \boldsymbol{v}\right) \tag{3.40} \end{equation*}(3.40)M(u,v)=α=03R(eα,u,ωα,v)
Here e α e α e_(alpha)\boldsymbol{e}_{\alpha}eα and ω α ω α omega^(alpha)\boldsymbol{\omega}^{\alpha}ωα are the basis vectors and 1-forms of a specific but arbitrary Lorentz coordinate frame. It makes no difference which frame is chosen; the result will always be the same (exercise 3.8 below). In terms of components in any Lorentz frame, equation (3.40) says (exercise 3.8)
M ( u , v ) = M μ ν u μ v ν = R α μ α α ν u μ v ν M ( u , v ) = M μ ν u μ v ν = R α μ α α ν u μ v ν M(u,v)=M_(mu nu)u^(mu)v^(nu)=R_(alpha mu)_(alpha)^(alpha)_(nu)u^(mu)v^(nu)\boldsymbol{M}(\boldsymbol{u}, \boldsymbol{v})=M_{\mu \nu} u^{\mu} v^{\nu}=R_{\alpha \mu}{ }_{\alpha}^{\alpha}{ }_{\nu} u^{\mu} v^{\nu}M(u,v)=Mμνuμvν=Rαμαανuμvν
so that
(3.41) M μ ν = R α μ α p . (3.41) M μ ν = R α μ α p . {:(3.41)M_(mu nu)=R_(alpha mu)^(alpha)_(p).:}\begin{equation*} M_{\mu \nu}=R_{\alpha \mu}{ }^{\alpha}{ }_{p} . \tag{3.41} \end{equation*}(3.41)Mμν=Rαμαp.
Thus, in terms of components, contraction amounts to putting one index up and the other down, and then summing on them.
Divergence is a third process for creating new tensors from old. It is accomplished by taking the gradient, then contracting the gradient's slot with one of the original slots:
(divergence of S S S\boldsymbol{S}S on first slot) S S -=grad*S\equiv \boldsymbol{\nabla} \cdot \boldsymbol{S}S is a machine such that
(3.42) S ( u , v ) = S ( ω α , u , v , e α ) = S α β γ , α u β v γ ; (3.42) S ( u , v ) = S ω α , u , v , e α = S α β γ , α u β v γ ; {:(3.42)grad*S(u","v)=grad S(omega^(alpha),u,v,e_(alpha))=S^(alpha)_(beta gamma,alpha)u^(beta)v^(gamma);:}\begin{equation*} \boldsymbol{\nabla} \cdot \boldsymbol{S}(\boldsymbol{u}, \boldsymbol{v})=\boldsymbol{\nabla} \boldsymbol{S}\left(\boldsymbol{\omega}^{\alpha}, \boldsymbol{u}, \boldsymbol{v}, \boldsymbol{e}_{\alpha}\right)=S^{\alpha}{ }_{\beta \gamma, \alpha} u^{\beta} v^{\gamma} ; \tag{3.42} \end{equation*}(3.42)S(u,v)=S(ωα,u,v,eα)=Sαβγ,αuβvγ;
i.e. S S grad*S\boldsymbol{\nabla} \cdot \boldsymbol{S}S has components S α β γ , α S α β γ , α S^(alpha)_(beta gamma,alpha)S^{\alpha}{ }_{\beta \gamma, \alpha}Sαβγ,α.
Transpose is a fourth, rather trivial process for creating new tensors. It merely interchanges two slots:
N N N\boldsymbol{N}N obtained by transposing second and third slots of S S S=>\boldsymbol{S} \RightarrowS
(3.43) N ( u , v , w ) = S ( u , w , v ) (3.43) N ( u , v , w ) = S ( u , w , v ) {:(3.43)N(u","v","w)=S(u","w","v):}\begin{equation*} \boldsymbol{N}(u, \boldsymbol{v}, \boldsymbol{w})=\boldsymbol{S}(\boldsymbol{u}, \boldsymbol{w}, \boldsymbol{v}) \tag{3.43} \end{equation*}(3.43)N(u,v,w)=S(u,w,v)
Symmetrization and antisymmetrization are fifth and sixth processes for producing new tensors from old. A tensor is completely symmetric if its output is unaffected by an interchange of two input vectors or 1 -forms:
(3.44a) S ( u , v , w ) = S ( v , u , w ) = S ( v , w , u ) = (3.44a) S ( u , v , w ) = S ( v , u , w ) = S ( v , w , u ) = {:(3.44a)S(u","v","w)=S(v","u","w)=S(v","w","u)=cdots:}\begin{equation*} \boldsymbol{S}(u, v, w)=\boldsymbol{S}(v, u, w)=\boldsymbol{S}(v, w, u)=\cdots \tag{3.44a} \end{equation*}(3.44a)S(u,v,w)=S(v,u,w)=S(v,w,u)=
It is completely antisymmetric if it reverses sign on each interchange of input
(3.44b) S ( u , v , w ) = S ( v , u , w ) = + S ( v , w , u ) = (3.44b) S ( u , v , w ) = S ( v , u , w ) = + S ( v , w , u ) = {:(3.44b)S(u","v","w)=-S(v","u","w)=+S(v","w","u)=cdots:}\begin{equation*} \boldsymbol{S}(u, \boldsymbol{v}, \boldsymbol{w})=-\boldsymbol{S}(\boldsymbol{v}, \boldsymbol{u}, \boldsymbol{w})=+\boldsymbol{S}(\boldsymbol{v}, \boldsymbol{w}, \boldsymbol{u})=\cdots \tag{3.44b} \end{equation*}(3.44b)S(u,v,w)=S(v,u,w)=+S(v,w,u)=
Any tensor can be symmetrized or antisymmetrized by constructing an appropriate linear combination of it and its transposes; see exercise 3.12 .
Wedge product is a seventh process for producing new tensors from old. It is merely an antisymmetrized tensor product: given two vectors u u u\boldsymbol{u}u and v v v\boldsymbol{v}v, their wedge product, the "bivector" u v u v u^^v\boldsymbol{u} \wedge \boldsymbol{v}uv, is defined by
(3.45a) u v u v v u (3.45a) u v u v v u {:(3.45a)u^^v-=u ox v-v ox u:}\begin{equation*} \boldsymbol{u} \wedge \boldsymbol{v} \equiv \boldsymbol{u} \otimes v-v \otimes \boldsymbol{u} \tag{3.45a} \end{equation*}(3.45a)uvuvvu
similarly, the "2-form" α β α β alpha^^beta\alpha \wedge \betaαβ constructed from two 1-forms is
(3.45b) α β α β β α (3.45b) α β α β β α {:(3.45b)alpha^^beta-=alpha ox beta-beta ox alpha:}\begin{equation*} \alpha \wedge \beta \equiv \alpha \otimes \beta-\beta \otimes \alpha \tag{3.45b} \end{equation*}(3.45b)αβαββα
From three vectors u , v , w u , v , w u,v,w\boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w}u,v,w one constructs the "trivector"
u v w ( u v ) w u ( v w ) = u v w + terms that guarantee complete antisymmetry (3.45c) = u v w + v w u + w u v v u w u w v w v u u v w ( u v ) w u ( v w ) = u v w +  terms that guarantee complete antisymmetry  (3.45c) = u v w + v w u + w u v v u w u w v w v u {:[u^^v^^w-=(u^^v)^^w-=u^^(v^^w)],[=u ox v ox w+" terms that guarantee complete antisymmetry "],[(3.45c)=u ox v ox w+v ox w ox u+w ox u ox v],[-v ox u ox w-u ox w ox v-w ox v ox u]:}\begin{align*} \boldsymbol{u} \wedge \boldsymbol{v} \wedge \boldsymbol{w} \equiv & (\boldsymbol{u} \wedge \boldsymbol{v}) \wedge \boldsymbol{w} \equiv \boldsymbol{u} \wedge(\boldsymbol{v} \wedge \boldsymbol{w}) \\ = & \boldsymbol{u} \otimes \boldsymbol{v} \otimes \boldsymbol{w}+\text { terms that guarantee complete antisymmetry } \\ = & \boldsymbol{u} \otimes \boldsymbol{v} \otimes \boldsymbol{w}+\boldsymbol{v} \otimes \boldsymbol{w} \otimes \boldsymbol{u}+\boldsymbol{w} \otimes \boldsymbol{u} \otimes \boldsymbol{v} \tag{3.45c}\\ & -\boldsymbol{v} \otimes \boldsymbol{u} \otimes \boldsymbol{w}-\boldsymbol{u} \otimes \boldsymbol{w} \otimes \boldsymbol{v}-\boldsymbol{w} \otimes \boldsymbol{v} \otimes \boldsymbol{u} \end{align*}uvw(uv)wu(vw)=uvw+ terms that guarantee complete antisymmetry (3.45c)=uvw+vwu+wuvvuwuwvwvu
From 1-forms α , β , γ α , β , γ alpha,beta,gamma\boldsymbol{\alpha}, \boldsymbol{\beta}, \boldsymbol{\gamma}α,β,γ one similarly constructs the "3-forms" α β γ α β γ alpha^^beta^^gamma\boldsymbol{\alpha} \wedge \boldsymbol{\beta} \wedge \boldsymbol{\gamma}αβγ. The wedge product gives a simple way to test for coplanarity (linear dependence) of vectors: if u u u\boldsymbol{u}u and v v v\boldsymbol{v}v are collinear, so u = a v u = a v u=av\boldsymbol{u}=a \boldsymbol{v}u=av, then
u v = a v v = 0 ( by antisymmetry of " " ) u v = a v v = 0 (  by antisymmetry of "  " ) u^^v=av^^v=0quad(" by antisymmetry of " "^^")\boldsymbol{u} \wedge \boldsymbol{v}=a \boldsymbol{v} \wedge \boldsymbol{v}=0 \quad(\text { by antisymmetry of " } \wedge ")uv=avv=0( by antisymmetry of " ")
If w w w\boldsymbol{w}w is coplanar with u u u\boldsymbol{u}u and v v v\boldsymbol{v}v so w = a u + b v w = a u + b v w=au+bv\boldsymbol{w}=a \boldsymbol{u}+b \boldsymbol{v}w=au+bv ("collapsed box"), then
w u v = a u u v + b v u v = 0 w u v = a u u v + b v u v = 0 w^^u^^v=au^^u^^v+bv^^u^^v=0\boldsymbol{w} \wedge \boldsymbol{u} \wedge \boldsymbol{v}=a \boldsymbol{u} \wedge \boldsymbol{u} \wedge \boldsymbol{v}+b \boldsymbol{v} \wedge \boldsymbol{u} \wedge \boldsymbol{v}=0wuv=auuv+bvuv=0
The symbol " ^^\wedge " is called a "hat" or "wedge" or "exterior product sign." Its properties are investigated in Chapter 4.
Taking the dual is an eighth process for constructing new tensors. It plays a Dual fundamental role in Track 2 of this book, but since it is not needed for Track 1, its definition and properties are treated only in the exercises ( 3.14 and 3.15 ).
Symmetrization and antisymmetrization
Wedge product
Bivector
2-form
Trivector
Because the frame-independent geometric notation is somewhat ambiguous (which slots are being contracted? on which slot is the divergence taken? which slots are being transposed?), one often uses component notation to express coordinate-independent, geometric relations between geometric objects. For example,
J β γ = S β γ , α α J β γ = S β γ , α α J_(beta gamma)=S_(beta gamma,alpha)^(alpha)J_{\beta \gamma}=S_{\beta \gamma, \alpha}^{\alpha}Jβγ=Sβγ,αα
means " J J J\boldsymbol{J}J is a tensor obtained by taking the divergence on the first slot of the tensor S S S^('')S^{\prime \prime}S. Also,
v γ = ( F μ ν F μ ν ) , γ ( F μ ν F μ ν ) , β η β γ v γ = F μ ν F μ ν , γ F μ ν F μ ν , β η β γ v^(gamma)=(F_(mu nu)F^(mu nu))^(,gamma)-=(F_(mu nu)F^(mu nu))_(,beta)eta^(beta gamma)v^{\gamma}=\left(F_{\mu \nu} F^{\mu \nu}\right)^{, \gamma} \equiv\left(F_{\mu \nu} F^{\mu \nu}\right)_{, \beta} \eta^{\beta \gamma}vγ=(FμνFμν),γ(FμνFμν),βηβγ
means " v v v\boldsymbol{v}v is a vector obtained by (1) constructing the tensor product F F F F F ox F\boldsymbol{F} \otimes \boldsymbol{F}FF of F F F\boldsymbol{F}F with itself, (2) contracting F F F F F ox F\boldsymbol{F} \otimes \boldsymbol{F}FF on its first and third slots, and also on its second and fourth, (3) taking the gradient of the resultant scalar function, (4) converting that gradient, which is a 1 -form, into the corresponding vector."
Index gymnastics
"Index gymnastics," the technique of extracting the content from geometric equations by working in component notation and rearranging indices as required, must be mastered if one wishes to do difficult calculations in relativity, special or general. Box 3.3 expounds some of the short cuts in index gymnastics, and exercises 3.8-3.18 offer practice.

EXERCISES

Exercise 3.8. CONTRACTION IS FRAME-INDEPENDENT

Show that contraction, as defined in equation (3.40), does not depend on which Lorentz frame e α e α e_(alpha)\boldsymbol{e}_{\alpha}eα and ω α ω α omega^(alpha)\boldsymbol{\omega}^{\alpha}ωα are taken from. Also show that equation (3.40) implies
M ( u , v ) = R α μ α ν u μ v v M ( u , v ) = R α μ α ν u μ v v M(u,v)=R_(alpha mu)^(alpha)_(nu)u^(mu)v^(v)\boldsymbol{M}(\boldsymbol{u}, \boldsymbol{v})=R_{\alpha \mu}{ }^{\alpha}{ }_{\nu} u^{\mu} v^{v}M(u,v)=Rαμανuμvv

Exercise 3.9. DIFFERENTIATION

(a) Justify the formula
d ( u μ v v ) / d τ = ( d u μ / d τ ) v v + u μ ( d v v / d τ ) d u μ v v / d τ = d u μ / d τ v v + u μ d v v / d τ d(u_(mu)v^(v))//d tau=(du_(mu)//d tau)v^(v)+u_(mu)(dv^(v)//d tau)d\left(u_{\mu} v^{v}\right) / d \tau=\left(d u_{\mu} / d \tau\right) v^{v}+u_{\mu}\left(d v^{v} / d \tau\right)d(uμvv)/dτ=(duμ/dτ)vv+uμ(dvv/dτ)
by considering the special case μ = 0 , ν = 1 μ = 0 , ν = 1 mu=0,nu=1\mu=0, \nu=1μ=0,ν=1.
(b) Explain why
( T α β v β ) , μ = T , μ α β v β + T α β v β , μ T α β v β , μ = T , μ α β v β + T α β v β , μ (T^(alpha beta)v_(beta))_(,mu)=T_(,mu)^(alpha beta)v_(beta)+T^(alpha beta)v_(beta,mu)\left(T^{\alpha \beta} v_{\beta}\right)_{, \mu}=T_{, \mu}^{\alpha \beta} v_{\beta}+T^{\alpha \beta} v_{\beta, \mu}(Tαβvβ),μ=T,μαβvβ+Tαβvβ,μ

Exercise 3.10. MORE DIFFERENTIATION

(a) Justify the formula,
d ( u μ u μ ) / d τ = 2 u μ ( d u μ / d τ ) d u μ u μ / d τ = 2 u μ d u μ / d τ d(u^(mu)u_(mu))//d tau=2u_(mu)(du^(mu)//d tau)d\left(u^{\mu} u_{\mu}\right) / d \tau=2 u_{\mu}\left(d u^{\mu} / d \tau\right)d(uμuμ)/dτ=2uμ(duμ/dτ)
by writing out the summation u μ u μ η μ ν u μ u ν u μ u μ η μ ν u μ u ν u^(mu)u_(mu)-=eta_(mu nu)u^(mu)u^(nu)u^{\mu} u_{\mu} \equiv \eta_{\mu \nu} u^{\mu} u^{\nu}uμuμημνuμuν explicitly.
(b) Let δ δ delta\deltaδ indicate a variation or small change, and justify the formula
δ ( F α β F α β ) = 2 F α β δ F α β . δ F α β F α β = 2 F α β δ F α β . delta(F_(alpha beta)F^(alpha beta))=2F_(alpha beta)deltaF^(alpha beta).\delta\left(F_{\alpha \beta} F^{\alpha \beta}\right)=2 F_{\alpha \beta} \delta F^{\alpha \beta} .δ(FαβFαβ)=2FαβδFαβ.
(c) Compute ( F α β F α β ) , μ = F α β F α β , μ = (F_(alpha beta)F^(alpha beta))_(,mu)=\left(F_{\alpha \beta} F^{\alpha \beta}\right)_{, \mu}=(FαβFαβ),μ= ?

Box 3.3 TECHNIQUES OF INDEX GYMNASTICS

Equation

Name and Discussion
S α β γ = S ( ω α , e β , e γ ) S α β γ = S ( ω α , ω β , e γ ) S = S α β γ e α ω β ω γ S = S α β γ e α e β e γ S α β γ = η β μ S α μ γ S α μ γ = η μ β S α β γ M μ = S α μ α T α β μ ν = S α β μ M v A 2 = A α A α η α β η β γ = δ α γ S α β γ = N α β , γ R β = N α β , α N α β , γ = ( η β μ N α μ ) , γ = η β μ N α μ , γ N α β γ N α β , μ η μ γ ( R α M β ) , γ = R α , γ M β + R α M β , γ G α β = F [ α β ] 1 2 ( F α β F β α ) H α β = F ( α β ) 1 2 ( F α β + F β α ) J α β γ = J μ ε μ α β γ F α β = 1 2 F μ v ε μ ν α β B α = 6 1 B λ μ ν ε λ μ ν α S α β γ = S ω α , e β , e γ S α β γ = S ω α , ω β , e γ S = S α β γ e α ω β ω γ S = S α β γ e α e β e γ S α β γ = η β μ S α μ γ S α μ γ = η μ β S α β γ M μ = S α μ α T α β μ ν = S α β μ M v A 2 = A α A α η α β η β γ = δ α γ S α β γ = N α β , γ R β = N α β , α N α β , γ = η β μ N α μ , γ = η β μ N α μ , γ N α β γ N α β , μ η μ γ R α M β , γ = R α , γ M β + R α M β , γ G α β = F [ α β ] 1 2 F α β F β α H α β = F ( α β ) 1 2 F α β + F β α J α β γ = J μ ε μ α β γ F α β = 1 2 F μ v ε μ ν α β B α = 6 1 B λ μ ν ε λ μ ν α {:[S^(alpha)_(beta gamma)=S(omega^(alpha),e_(beta),e_(gamma))],[S^(alpha beta)_(gamma)=S(omega^(alpha),omega^(beta),e_(gamma))],[S=S^(alpha)_(beta gamma)e_(alpha)oxomega^(beta)oxomega^(gamma)],[S=S^(alpha beta gamma)e_(alpha)oxe_(beta)oxe_(gamma)],[S^(alpha beta)_(gamma)=eta^(beta mu)S^(alpha)_(mu gamma)],[S^(alpha)_(mu gamma)=eta_(mu beta)S^(alpha beta)_(gamma)],[M_(mu)=S^(alpha)_(mu alpha)],[T^(alpha beta)_(mu nu)=S^(alpha beta)_(mu)M_(v)],[A^(2)=A^(alpha)A_(alpha)],[eta_(alpha beta)eta^(beta gamma)=delta_(alpha)^(gamma)],[S^(alpha)_(beta gamma)=N^(alpha)_(beta,gamma)],[R_(beta)=N^(alpha)_(beta,alpha)],[N^(alpha)_(beta,gamma)=(eta_(beta mu)N^(alpha mu))_(,gamma)=eta_(beta mu)N^(alpha mu)_(,gamma)],[N^(alpha)_(beta)^(gamma)-=N^(alpha)_(beta,mu)eta^(mu gamma)],[(R_(alpha)M_(beta))_(,gamma)=R_(alpha,gamma)M_(beta)+R_(alpha)M_(beta,gamma)],[G_(alpha beta)=F_([alpha beta])-=(1)/(2)(F_(alpha beta)-F_(beta alpha))],[H_(alpha beta)=F_((alpha beta))-=(1)/(2)(F_(alpha beta)+F_(beta alpha))],[^(**)J_(alpha beta gamma)=J^(mu)epsi_(mu alpha beta gamma)],[^(**)F_(alpha beta)=(1)/(2)F^(mu v)epsi_(mu nu alpha beta)],[^(**)B_(alpha)=_(6)^(1)B^(lambda mu nu)epsi_(lambda mu nu alpha)]:}\begin{aligned} & S^{\alpha}{ }_{\beta \gamma}=S\left(\boldsymbol{\omega}^{\alpha}, \boldsymbol{e}_{\beta}, \boldsymbol{e}_{\gamma}\right) \\ & S^{\alpha \beta}{ }_{\gamma}=S\left(\boldsymbol{\omega}^{\alpha}, \boldsymbol{\omega}^{\beta}, \boldsymbol{e}_{\gamma}\right) \\ & \boldsymbol{S}=S^{\alpha}{ }_{\beta \gamma} \boldsymbol{e}_{\alpha} \otimes \boldsymbol{\omega}^{\beta} \otimes \boldsymbol{\omega}^{\gamma} \\ & \boldsymbol{S}=S^{\alpha \beta \gamma} \boldsymbol{e}_{\alpha} \otimes \boldsymbol{e}_{\beta} \otimes \boldsymbol{e}_{\gamma} \\ & S^{\alpha \beta}{ }_{\gamma}=\eta^{\beta \mu} S^{\alpha}{ }_{\mu \gamma} \\ & S^{\alpha}{ }_{\mu \gamma}=\eta_{\mu \beta} S^{\alpha \beta}{ }_{\gamma} \\ & M_{\mu}=S^{\alpha}{ }_{\mu \alpha} \\ & T^{\alpha \beta}{ }_{\mu \nu}=S^{\alpha \beta}{ }_{\mu} M_{v} \\ & \boldsymbol{A}^{2}=A^{\alpha} A_{\alpha} \\ & \eta_{\alpha \beta} \eta^{\beta \gamma}=\delta_{\alpha}{ }^{\gamma} \\ & S^{\alpha}{ }_{\beta \gamma}=N^{\alpha}{ }_{\beta, \gamma} \\ & R_{\beta}=N^{\alpha}{ }_{\beta, \alpha} \\ & N^{\alpha}{ }_{\beta, \gamma}=\left(\eta_{\beta \mu} N^{\alpha \mu}\right)_{, \gamma}=\eta_{\beta \mu} N^{\alpha \mu}{ }_{, \gamma} \\ & N^{\alpha}{ }_{\beta}{ }^{\gamma} \equiv N^{\alpha}{ }_{\beta, \mu} \eta^{\mu \gamma} \\ & \left(R_{\alpha} M_{\beta}\right)_{, \gamma}=R_{\alpha, \gamma} M_{\beta}+R_{\alpha} M_{\beta, \gamma} \\ & G_{\alpha \beta}=F_{[\alpha \beta]} \equiv \frac{1}{2}\left(F_{\alpha \beta}-F_{\beta \alpha}\right) \\ & H_{\alpha \beta}=F_{(\alpha \beta)} \equiv \frac{1}{2}\left(F_{\alpha \beta}+F_{\beta \alpha}\right) \\ & { }^{*} J_{\alpha \beta \gamma}=J^{\mu} \varepsilon_{\mu \alpha \beta \gamma} \\ & { }^{*} F_{\alpha \beta}=\frac{1}{2} F^{\mu v} \varepsilon_{\mu \nu \alpha \beta} \\ & { }^{*} B_{\alpha}={ }_{6}^{1} B^{\lambda \mu \nu} \varepsilon_{\lambda \mu \nu \alpha} \end{aligned}Sαβγ=S(ωα,eβ,eγ)Sαβγ=S(ωα,ωβ,eγ)S=SαβγeαωβωγS=SαβγeαeβeγSαβγ=ηβμSαμγSαμγ=ημβSαβγMμ=SαμαTαβμν=SαβμMvA2=AαAαηαβηβγ=δαγSαβγ=Nαβ,γRβ=Nαβ,αNαβ,γ=(ηβμNαμ),γ=ηβμNαμ,γNαβγNαβ,μημγ(RαMβ),γ=Rα,γMβ+RαMβ,γGαβ=F[αβ]12(FαβFβα)Hαβ=F(αβ)12(Fαβ+Fβα)Jαβγ=JμεμαβγFαβ=12FμvεμναβBα=61Bλμνελμνα
Computing other components.
Reconstructing the rank-( 1 2 1 2 (1)/(2)\frac{1}{2}12 ) version of S S S\boldsymbol{S}S.
Reconstructing the rank- ( 3 0 ) ( 3 0 ) ((3)/(0))\binom{3}{0}(30) version of S S S\boldsymbol{S}S. [Recall: one does not usually distinguish between the various versions; see equation (3.15) and associated discussion.]
Raising an index.
Lowering an index.
Contraction of S S S\boldsymbol{S}S to form a new tensor M M M\boldsymbol{M}M.
Tensor product of S S S\boldsymbol{S}S with M M M\boldsymbol{M}M to form a new tensor T T T\boldsymbol{T}T.
Squared length of vector A A A\boldsymbol{A}A produced by forming tensor product A A A A A ox A\boldsymbol{A} \otimes \boldsymbol{A}AA and then contracting, which is the same as forming the corresponding 1-form A ~ A ~ widetilde(A)\widetilde{\boldsymbol{A}}A~ and then piercing: A 2 = A ~ , A = A α A α A 2 = A ~ , A = A α A α A^(2)=(: widetilde(A),A:)=A^(alpha)A_(alpha)\boldsymbol{A}^{2}=\langle\widetilde{\boldsymbol{A}}, \boldsymbol{A}\rangle=A^{\alpha} A_{\alpha}A2=A~,A=AαAα.
The matrix formed from the metric's "covariant components," η α β η α β ||eta_(alpha beta)||\left\|\eta_{\alpha \beta}\right\|ηαβ, is the inverse of that formed from its "contravariant components," η α β η α β ||eta^(alpha beta)||\left\|\eta^{\alpha \beta}\right\|ηαβ. Equivalently, raising one index of the metric η α β η α β eta_(alpha beta)\eta_{\alpha \beta}ηαβ produces the Kronecker delta.
Gradient of N N N\boldsymbol{N}N to form a new tensor S S S\boldsymbol{S}S.
Divergence of N N N\boldsymbol{N}N to form a new tensor R R R\boldsymbol{R}R.
Taking gradients and raising or lowering indices are operations that commute.
Contravariant index on a gradient is obtained by raising covariant index.
Gradient of a tensor product; says ( R M ) = ( R M ) = grad(R ox M)=\boldsymbol{\nabla}(\boldsymbol{R} \otimes \boldsymbol{M})=(RM)=
Transpose ( R M ) + R M ( R M ) + R M (grad R ox M)+R ox grad M(\boldsymbol{\nabla} \boldsymbol{R} \otimes \boldsymbol{M})+\boldsymbol{R} \otimes \boldsymbol{\nabla} \boldsymbol{M}(RM)+RM.
Antisymmetrizing a tensor F F F\boldsymbol{F}F to produce a new tensor G G G\boldsymbol{G}G.
Symmetrizing a tensor F F F\boldsymbol{F}F to produce a new tensor H H H\boldsymbol{H}H.
Forming the rank-3 tensor that is dual to a vector (exercise 3.14).
Forming the antisymmetric rank-2 tensor that is dual to a given antisymmetric rank-2 tensor (exercise 3.14).
Forming the 1 -form that is dual to an antisymmetric rank-3 tensor (exercise 3.14).

Exercise 3.11. SYMMETRIES

Let A μ ν A μ ν A_(mu nu)A_{\mu \nu}Aμν be an antisymmetric tensor so that A μ ν = A ν μ A μ ν = A ν μ A_(mu nu)=-A_(nu mu)A_{\mu \nu}=-A_{\nu \mu}Aμν=Aνμ; and let S μ ν S μ ν S^(mu nu)S^{\mu \nu}Sμν be a symmetric tensor so that S μ ν = S ν μ S μ ν = S ν μ S^(mu nu)=S^(nu mu)S^{\mu \nu}=S^{\nu \mu}Sμν=Sνμ.
(a) Justify the equations A μ ν S μ ν = 0 A μ ν S μ ν = 0 A_(mu nu)S^(mu nu)=0A_{\mu \nu} S^{\mu \nu}=0AμνSμν=0 in two ways: first, by writing out the sum explicitly (all sixteen terms) and showing how the terms in the sum cancel in pairs; second, by giving an argument to justify each equals sign in the following string:
A μ ν S μ ν = A ν μ S μ ν = A ν μ S ν μ = A α β S α β = A μ ν S μ ν = 0 A μ ν S μ ν = A ν μ S μ ν = A ν μ S ν μ = A α β S α β = A μ ν S μ ν = 0 A_(mu nu)S^(mu nu)=-A_(nu mu)S^(mu nu)=-A_(nu mu)S^(nu mu)=-A_(alpha beta)S^(alpha beta)=-A_(mu nu)S^(mu nu)=0A_{\mu \nu} S^{\mu \nu}=-A_{\nu \mu} S^{\mu \nu}=-A_{\nu \mu} S^{\nu \mu}=-A_{\alpha \beta} S^{\alpha \beta}=-A_{\mu \nu} S^{\mu \nu}=0AμνSμν=AνμSμν=AνμSνμ=AαβSαβ=AμνSμν=0
(b) Establish the following two identities for any arbitrary tensor V μ ν V μ ν V_(mu nu)V_{\mu \nu}Vμν :
V μ ν A μ ν = 1 2 ( V μ ν V ν μ ) A μ ν , V μ ν S μ ν = 1 2 ( V μ ν + V ν μ ) S μ ν . V μ ν A μ ν = 1 2 V μ ν V ν μ A μ ν , V μ ν S μ ν = 1 2 V μ ν + V ν μ S μ ν . V^(mu nu)A_(mu nu)=(1)/(2)(V^(mu nu)-V^(nu mu))A_(mu nu),quadV^(mu nu)S_(mu nu)=(1)/(2)(V^(mu nu)+V^(nu mu))S_(mu nu).V^{\mu \nu} A_{\mu \nu}=\frac{1}{2}\left(V^{\mu \nu}-V^{\nu \mu}\right) A_{\mu \nu}, \quad V^{\mu \nu} S_{\mu \nu}=\frac{1}{2}\left(V^{\mu \nu}+V^{\nu \mu}\right) S_{\mu \nu} .VμνAμν=12(VμνVνμ)Aμν,VμνSμν=12(Vμν+Vνμ)Sμν.

Exercise 3.12. SYMMETRIZATION AND ANTISYMMETRIZATION

To "symmetrize" a tensor, one averages it with all of its transposes. The components of the new, symmetrized tensor are distinguished by round brackets:
V ( μ ν ) 1 2 ( V μ ν + V ν μ ) (3.46) V ( μ ν λ ) 1 3 ! ( V μ ν λ + V ν λ μ + V λ μ ν + V ν μ λ + V μ λ ν + V λ ν μ ) V ( μ ν ) 1 2 V μ ν + V ν μ (3.46) V ( μ ν λ ) 1 3 ! V μ ν λ + V ν λ μ + V λ μ ν + V ν μ λ + V μ λ ν + V λ ν μ {:[V_((mu nu))-=(1)/(2)(V_(mu nu)+V_(nu mu))],[(3.46)V_((mu nu lambda))-=(1)/(3!)(V_(mu nu lambda)+V_(nu lambda mu)+V_(lambda mu nu)+V_(nu mu lambda)+V_(mu lambda nu)+V_(lambda nu mu))]:}\begin{align*} V_{(\mu \nu)} & \equiv \frac{1}{2}\left(V_{\mu \nu}+V_{\nu \mu}\right) \\ V_{(\mu \nu \lambda)} & \equiv \frac{1}{3!}\left(V_{\mu \nu \lambda}+V_{\nu \lambda \mu}+V_{\lambda \mu \nu}+V_{\nu \mu \lambda}+V_{\mu \lambda \nu}+V_{\lambda \nu \mu}\right) \tag{3.46} \end{align*}V(μν)12(Vμν+Vνμ)(3.46)V(μνλ)13!(Vμνλ+Vνλμ+Vλμν+Vνμλ+Vμλν+Vλνμ)
One "antisymmetrizes" a tensor (square brackets) similarly:
V [ μ ν ] 1 2 ( V μ ν V ν μ ) (3.47) V [ μ ν λ ] 1 3 ! ( V μ ν λ + V ν λ μ + V λ μ ν V ν μ λ V μ λ ν V λ ν ν ) . V [ μ ν ] 1 2 V μ ν V ν μ (3.47) V [ μ ν λ ] 1 3 ! V μ ν λ + V ν λ μ + V λ μ ν V ν μ λ V μ λ ν V λ ν ν . {:[V_([mu nu])-=(1)/(2)(V_(mu nu)-V_(nu mu))],[(3.47)V_([mu nu lambda])-=(1)/(3!)(V_(mu nu lambda)+V_(nu lambda mu)+V_(lambda mu nu)-V_(nu mu lambda)-V_(mu lambda nu)-V_(lambda nu nu)).]:}\begin{align*} V_{[\mu \nu]} & \equiv \frac{1}{2}\left(V_{\mu \nu}-V_{\nu \mu}\right) \\ V_{[\mu \nu \lambda]} & \equiv \frac{1}{3!}\left(V_{\mu \nu \lambda}+V_{\nu \lambda \mu}+V_{\lambda \mu \nu}-V_{\nu \mu \lambda}-V_{\mu \lambda \nu}-V_{\lambda \nu \nu}\right) . \tag{3.47} \end{align*}V[μν]12(VμνVνμ)(3.47)V[μνλ]13!(Vμνλ+Vνλμ+VλμνVνμλVμλνVλνν).
(a) Show that such symmetrized and antisymmetrized tensors are, indeed, symmetric and antisymmetric under interchange of the vectors inserted into their slots:
V ( α β γ ) u α v β w γ = + V ( α β γ v α u β w γ = , V [ α β γ ] u α v β w γ = V [ α β γ ] v α u β w γ = V ( α β γ ) u α v β w γ = + V ( α β γ v α u β w γ = , V [ α β γ ] u α v β w γ = V [ α β γ ] v α u β w γ = {:[V_((alpha beta gamma))u^(alpha)v^(beta)w^(gamma)=+V_((alpha beta gamma)v^(alpha)u^(beta)w^(gamma)=cdots","],[V_([alpha beta gamma])u^(alpha)v^(beta)w^(gamma)=-V_([alpha beta gamma])v^(alpha)u^(beta)w^(gamma)=cdots]:}\begin{aligned} & V_{(\alpha \beta \gamma)} u^{\alpha} v^{\beta} w^{\gamma}=+V_{(\alpha \beta \gamma} v^{\alpha} u^{\beta} w^{\gamma}=\cdots, \\ & V_{[\alpha \beta \gamma]} u^{\alpha} v^{\beta} w^{\gamma}=-V_{[\alpha \beta \gamma]} v^{\alpha} u^{\beta} w^{\gamma}=\cdots \end{aligned}V(αβγ)uαvβwγ=+V(αβγvαuβwγ=,V[αβγ]uαvβwγ=V[αβγ]vαuβwγ=
(b) Show that a second-rank tensor can be reconstructed from its symmetric and antisymmetric parts,
(3.48) V μ ν = V ( μ ν ) + V [ μ ν ] , (3.48) V μ ν = V ( μ ν ) + V [ μ ν ] , {:(3.48)V_(mu nu)=V_((mu nu))+V_([mu nu])",":}\begin{equation*} V_{\mu \nu}=V_{(\mu \nu)}+V_{[\mu \nu]}, \tag{3.48} \end{equation*}(3.48)Vμν=V(μν)+V[μν],
but that a third-rank tensor cannot; V ( α β γ ) V ( α β γ ) V_((alpha beta gamma))V_{(\alpha \beta \gamma)}V(αβγ) and V [ α β γ ] V [ α β γ ] V_([alpha beta gamma])V_{[\alpha \beta \gamma]}V[αβγ] contain together "less information" than V α β γ V α β γ V_(alpha beta gamma)V_{\alpha \beta \gamma}Vαβγ "Young diagrams" (see, e.g., Messiah [1961], appendix D) describe other symmetries, more subtle than these two, which contain the missing information.
(c) Show that the electromagnetic field tensor satisfies
(3.49a) F ( α β ) = 0 , F α β = F [ α β ] . (3.49a) F ( α β ) = 0 , F α β = F [ α β ] . {:(3.49a)F_((alpha beta))=0","quadF_(alpha beta)=F_([alpha beta]).:}\begin{equation*} F_{(\alpha \beta)}=0, \quad F_{\alpha \beta}=F_{[\alpha \beta]} . \tag{3.49a} \end{equation*}(3.49a)F(αβ)=0,Fαβ=F[αβ].
(d) Show that Maxwell's "magnetic" equations
F α β , γ + F β γ , α + F γ α , β = 0 F α β , γ + F β γ , α + F γ α , β = 0 F_(alpha beta,gamma)+F_(beta gamma,alpha)+F_(gamma alpha,beta)=0F_{\alpha \beta, \gamma}+F_{\beta \gamma, \alpha}+F_{\gamma \alpha, \beta}=0Fαβ,γ+Fβγ,α+Fγα,β=0
can be rewritten in the form
(3.49b) F [ α β , γ ] = 0 . (3.49b) F [ α β , γ ] = 0 . {:(3.49b)F_([alpha beta,gamma])=0.:}\begin{equation*} F_{[\alpha \beta, \gamma]}=0 . \tag{3.49b} \end{equation*}(3.49b)F[αβ,γ]=0.

Exercise 3.13. LEVI-CIVITA TENSOR

The "Levi-Civita tensor" ε ε epsi\varepsilonε in spacetime is a fourth-rank, completely antisymmetric tensor:
ε ( n , u , v , w ) ε ( n , u , v , w ) epsi(n,u,v,w)\varepsilon(\boldsymbol{n}, \boldsymbol{u}, \boldsymbol{v}, \boldsymbol{w})ε(n,u,v,w) changes sign when any two of the
vectors are interchanged.
Choose an arbitrary but specific Lorentz frame, with e 0 e 0 e_(0)\boldsymbol{e}_{0}e0 pointing toward the future and with e 1 , e 2 , e 3 e 1 , e 2 , e 3 e_(1),e_(2),e_(3)\boldsymbol{e}_{1}, \boldsymbol{e}_{2}, \boldsymbol{e}_{3}e1,e2,e3 a righthanded set of spatial basis vectors. The covariant components of ε ε epsi\boldsymbol{\varepsilon}ε in this frame are
(3.50b) ε 0123 = ε ( e 0 , e 1 , e 2 , e 3 ) = + 1 (3.50b) ε 0123 = ε e 0 , e 1 , e 2 , e 3 = + 1 {:(3.50b)epsi_(0123)=epsi(e_(0),e_(1),e_(2),e_(3))=+1:}\begin{equation*} \varepsilon_{0123}=\varepsilon\left(\boldsymbol{e}_{0}, \boldsymbol{e}_{1}, \boldsymbol{e}_{2}, \boldsymbol{e}_{3}\right)=+1 \tag{3.50b} \end{equation*}(3.50b)ε0123=ε(e0,e1,e2,e3)=+1
[Note: In an n n nnn-dimensional space, ε ε epsi\boldsymbol{\varepsilon}ε is the analogous completely antisymmetric rank- n n nnn tensor. Its components are
(3.50c) ε 12 n = ε ( e 1 , e 2 , , e n ) = + 1 (3.50c) ε 12 n = ε e 1 , e 2 , , e n = + 1 {:(3.50c)epsi_(12 dots n)=epsi(e_(1),e_(2),dots,e_(n))=+1:}\begin{equation*} \varepsilon_{12 \ldots n}=\boldsymbol{\varepsilon}\left(\boldsymbol{e}_{1}, \boldsymbol{e}_{2}, \ldots, \boldsymbol{e}_{n}\right)=+1 \tag{3.50c} \end{equation*}(3.50c)ε12n=ε(e1,e2,,en)=+1
when computed on a "positively oriented," orthonormal basis e 1 , , e n e 1 , , e n e_(1),dots,e_(n)\boldsymbol{e}_{1}, \ldots, \boldsymbol{e}_{n}e1,,en.]
(a) Use the antisymmetry to show that
(3.50~d) ε α β γ δ = 0 unless α , β , γ , δ are all different, (3.50e) ε π 0 π 1 π 2 π 3 = { + 1 for even permutations of 0 , 1 , 2 , 3 , and 1 for odd permutations. (3.50~d) ε α β γ δ = 0  unless  α , β , γ , δ  are all different,  (3.50e) ε π 0 π 1 π 2 π 3 = + 1  for even permutations of  0 , 1 , 2 , 3 ,  and  1  for odd permutations.  {:[(3.50~d)epsi_(alpha beta gamma delta)=0" unless "alpha","beta","gamma","delta" are all different, "],[(3.50e)epsi_(pi0pi_(1)pi_(2)pi_(3))={[+1" for even permutations of "0","1","2","3","" and "],[-1" for odd permutations. "]:}]:}\begin{align*} \varepsilon_{\alpha \beta \gamma \delta} & =0 \text { unless } \alpha, \beta, \gamma, \delta \text { are all different, } \tag{3.50~d}\\ \varepsilon_{\pi 0 \pi_{1} \pi_{2} \pi_{3}} & =\left\{\begin{array}{l} +1 \text { for even permutations of } 0,1,2,3, \text { and } \\ -1 \text { for odd permutations. } \end{array}\right. \tag{3.50e} \end{align*}(3.50~d)εαβγδ=0 unless α,β,γ,δ are all different, (3.50e)επ0π1π2π3={+1 for even permutations of 0,1,2,3, and 1 for odd permutations. 
(b) Show that
(3.50f) ε π 0 π 1 π 2 π 3 = ε π 0 π 1 π 2 π 3 . (3.50f) ε π 0 π 1 π 2 π 3 = ε π 0 π 1 π 2 π 3 . {:(3.50f)epsi^(pi_(0)pi_(1)pi_(2)pi_(3))=-epsi_(pi_(0)pi_(1)pi_(2)pi_(3)).:}\begin{equation*} \varepsilon^{\pi_{0} \pi_{1} \pi_{2} \pi_{3}}=-\varepsilon_{\pi_{0} \pi_{1} \pi_{2} \pi_{3}} . \tag{3.50f} \end{equation*}(3.50f)επ0π1π2π3=επ0π1π2π3.
(c) By means of a Lorentz transformation show that ε α ¯ β ¯ γ δ ¯ ε α ¯ β ¯ γ δ ¯ epsi^( bar(alpha) bar(beta)gamma bar(delta))\varepsilon^{\bar{\alpha} \bar{\beta} \gamma \bar{\delta}}εα¯β¯γδ¯ and ε α ¯ β ¯ γ ¯ δ ¯ ε α ¯ β ¯ γ ¯ δ ¯ epsi_( bar(alpha) bar(beta) bar(gamma) bar(delta))\varepsilon_{\bar{\alpha} \bar{\beta} \bar{\gamma} \bar{\delta}}εα¯β¯γ¯δ¯ have these same values in any other Lorentz frame with e 0 e 0 ¯ e_( bar(0))\boldsymbol{e}_{\overline{0}}e0 pointing toward the future and with e 1 , e 2 , e 3 e 1 ¯ , e 2 ¯ , e 3 ¯ e_( bar(1)),e_( bar(2)),e_( bar(3))\boldsymbol{e}_{\overline{1}}, \boldsymbol{e}_{\overline{2}}, \boldsymbol{e}_{\overline{3}}e1,e2,e3 a righthanded set. Hint: show that
(3.50~g) ε α β γ δ Λ 0 α Λ 1 Λ 2 γ Λ 3 δ = det | Λ μ ¯ ν | ; (3.50~g) ε α β γ δ Λ 0 ¯ α Λ 1 ¯ Λ 2 ¯ γ Λ 3 ¯ δ = det Λ μ ¯ ν ; {:(3.50~g)epsi^(alpha beta gamma delta)Lambda^( bar(0))_(alpha)Lambda^( bar(1))Lambda^( bar(2))_(gamma)Lambda^( bar(3))_(delta)=-det|Lambda^( bar(mu))_(nu)|;:}\begin{equation*} \varepsilon^{\alpha \beta \gamma \delta} \Lambda^{\overline{0}}{ }_{\alpha} \Lambda^{\overline{1}} \Lambda^{\overline{2}}{ }_{\gamma} \Lambda^{\overline{3}}{ }_{\delta}=-\operatorname{det}\left|\Lambda^{\bar{\mu}}{ }_{\nu}\right| ; \tag{3.50~g} \end{equation*}(3.50~g)εαβγδΛ0αΛ1Λ2γΛ3δ=det|Λμ¯ν|;
from Λ T η Λ = η Λ T η Λ = η Lambda^(T)eta Lambda=eta\Lambda^{T} \eta \Lambda=\etaΛTηΛ=η, show that det | Λ μ ¯ ν | = ± 1 det Λ μ ¯ ν = ± 1 det|Lambda^( bar(mu))_(nu)|=+-1\operatorname{det}\left|\Lambda^{\bar{\mu}}{ }_{\nu}\right|= \pm 1det|Λμ¯ν|=±1; and verify that the determinant is +1 for transformations between frames with e 0 e 0 e_(0)\boldsymbol{e}_{0}e0 and e 0 e 0 e_(0)\boldsymbol{e}_{0}e0 future-pointing, and with e 1 , e 2 , e 3 e 1 , e 2 , e 3 e_(1),e_(2),e_(3)\boldsymbol{e}_{1}, \boldsymbol{e}_{2}, \boldsymbol{e}_{3}e1,e2,e3 and e 1 , e 2 e 1 ¯ , e 2 ¯ e_( bar(1)),e_( bar(2))\boldsymbol{e}_{\overline{1}}, \boldsymbol{e}_{\overline{2}}e1,e2, e 3 e 3 ¯ e_( bar(3))\boldsymbol{e}_{\overline{3}}e3 righthanded.
(d) What are the components of ε ε epsi\boldsymbol{\varepsilon}ε in a Lorentz frame with past-pointing e 0 e 0 ¯ e_( bar(0))\boldsymbol{e}_{\overline{0}}e0 ? with lefthanded e 1 , e 2 , e 3 e 1 ¯ , e 2 ¯ , e 3 ¯ e_( bar(1)),e_( bar(2)),e_( bar(3))\boldsymbol{e}_{\overline{1}}, \boldsymbol{e}_{\overline{2}}, \boldsymbol{e}_{\overline{3}}e1,e2,e3 ?
(e) From the Levi-Civita tensor, one can construct several "permutation tensors." In index notation:
(3.50h) δ α β γ μ ν λ ε α β γ ρ ε μ ν λ ρ (3.50i) δ α β μ ν 1 2 δ α β λ μ ν λ = 1 2 ε α β λ ρ ε μ ν λ ρ (3.50j) δ α μ 1 3 δ α β μ β = 1 6 δ α β λ μ β λ = 1 6 ε α β λ ρ ε μ β λ ρ . (3.50h) δ α β γ μ ν λ ε α β γ ρ ε μ ν λ ρ (3.50i) δ α β μ ν 1 2 δ α β λ μ ν λ = 1 2 ε α β λ ρ ε μ ν λ ρ (3.50j) δ α μ 1 3 δ α β μ β = 1 6 δ α β λ μ β λ = 1 6 ε α β λ ρ ε μ β λ ρ . {:[(3.50h)delta^(alpha beta gamma)_(mu nu lambda)-=-epsi^(alpha beta gamma rho)epsi_(mu nu lambda rho)],[(3.50i)delta^(alpha beta)_(mu nu)-=(1)/(2)delta^(alpha beta lambda)_(mu nu lambda)=-(1)/(2)epsi^(alpha beta lambda rho)epsi_(mu nu lambda rho)],[(3.50j)delta^(alpha)_(mu)-=(1)/(3)delta^(alpha beta)_(mu beta)=(1)/(6)delta^(alpha beta lambda)_(mu beta lambda)=-(1)/(6)epsi^(alpha beta lambda rho)epsi_(mu beta lambda rho).]:}\begin{align*} \delta^{\alpha \beta \gamma}{ }_{\mu \nu \lambda} & \equiv-\varepsilon^{\alpha \beta \gamma \rho} \varepsilon_{\mu \nu \lambda \rho} \tag{3.50h}\\ \delta^{\alpha \beta}{ }_{\mu \nu} & \equiv \frac{1}{2} \delta^{\alpha \beta \lambda}{ }_{\mu \nu \lambda}=-\frac{1}{2} \varepsilon^{\alpha \beta \lambda \rho} \varepsilon_{\mu \nu \lambda \rho} \tag{3.50i}\\ \delta^{\alpha}{ }_{\mu} & \equiv \frac{1}{3} \delta^{\alpha \beta}{ }_{\mu \beta}=\frac{1}{6} \delta^{\alpha \beta \lambda}{ }_{\mu \beta \lambda}=-\frac{1}{6} \varepsilon^{\alpha \beta \lambda \rho} \varepsilon_{\mu \beta \lambda \rho} . \tag{3.50j} \end{align*}(3.50h)δαβγμνλεαβγρεμνλρ(3.50i)δαβμν12δαβλμνλ=12εαβλρεμνλρ(3.50j)δαμ13δαβμβ=16δαβλμβλ=16εαβλρεμβλρ.
Show that:
δ α β γ μ ν λ = { (3.50k) + 1 if α β γ is an even permutation of μ ν λ , 1 if α β γ is an odd permutation of μ ν λ , 0 otherwise; δ α β γ μ ν λ = (3.50k) + 1  if  α β γ  is an even permutation of  μ ν λ , 1  if  α β γ  is an odd permutation of  μ ν λ , 0  otherwise;  delta^(alpha beta gamma)_(mu nu lambda)={[(3.50k)+1" if "alpha beta gamma" is an even permutation of "mu nu lambda","],[-1" if "alpha beta gamma" is an odd permutation of "mu nu lambda","],[0" otherwise; "]:}\delta^{\alpha \beta \gamma}{ }_{\mu \nu \lambda}=\left\{\begin{align*} +1 & \text { if } \alpha \beta \gamma \text { is an even permutation of } \mu \nu \lambda, \tag{3.50k}\\ -1 & \text { if } \alpha \beta \gamma \text { is an odd permutation of } \mu \nu \lambda, \\ 0 & \text { otherwise; } \end{align*}\right.δαβγμνλ={(3.50k)+1 if αβγ is an even permutation of μνλ,1 if αβγ is an odd permutation of μνλ,0 otherwise; 
δ α β μ ν = δ α μ β β ν δ α ν δ β μ (3.501) = { + 1 if α β is an even permutation of μ ν , 1 if α β is an odd permutation of μ ν , 0 otherwise; (3.50~m) δ α μ = { + 1 if α = μ , 0 otherwise. δ α β μ ν = δ α μ β β ν δ α ν δ β μ (3.501) = + 1  if  α β  is an even permutation of  μ ν , 1  if  α β  is an odd permutation of  μ ν , 0  otherwise;  (3.50~m) δ α μ = + 1  if  α = μ , 0  otherwise.  {:[delta^(alpha beta)_(mu nu)=delta^(alpha)_(mu)^(beta)^(beta)_(nu)-delta^(alpha)_(nu)^(delta)^(beta)_(mu)],[(3.501)={[+1" if "alpha beta" is an even permutation of "mu nu","],[-1" if "alpha beta" is an odd permutation of "mu nu","],[0" otherwise; "]:}],[(3.50~m)delta^(alpha)_(mu)={[+1" if "alpha=mu","],[0" otherwise. "]:}]:}\begin{align*} & \delta^{\alpha \beta}{ }_{\mu \nu}=\delta^{\alpha}{ }_{\mu}{ }^{\beta}{ }^{\beta}{ }_{\nu}-\delta^{\alpha}{ }_{\nu}{ }^{\delta}{ }^{\beta}{ }_{\mu} \\ & =\left\{\begin{array}{l} +1 \text { if } \alpha \beta \text { is an even permutation of } \mu \nu, \\ -1 \text { if } \alpha \beta \text { is an odd permutation of } \mu \nu, \\ 0 \text { otherwise; } \end{array}\right. \tag{3.501}\\ & \delta^{\alpha}{ }_{\mu}=\left\{\begin{array}{r} +1 \text { if } \alpha=\mu, \\ 0 \text { otherwise. } \end{array}\right. \tag{3.50~m} \end{align*}δαβμν=δαμββνδανδβμ(3.501)={+1 if αβ is an even permutation of μν,1 if αβ is an odd permutation of μν,0 otherwise; (3.50~m)δαμ={+1 if α=μ,0 otherwise. 

Exercise 3.14. DUALS

From any vector J J J\boldsymbol{J}J, any second-rank antisymmetric tensor F ( F α β = F [ α β ] ) F F α β = F [ α β ] F(F_(alpha beta)=F_([alpha beta]))\boldsymbol{F}\left(F_{\alpha \beta}=F_{[\alpha \beta]}\right)F(Fαβ=F[αβ]), and any third-rank antisymmetric tensor B ( B α β γ = B [ α β γ ] ) B B α β γ = B [ α β γ ] B(B_(alpha beta gamma)=B_([alpha beta gamma]))\boldsymbol{B}\left(B_{\alpha \beta \gamma}=B_{[\alpha \beta \gamma]}\right)B(Bαβγ=B[αβγ]), one can construct new tensors defined by
(3.51) J α β γ = J μ ε μ α β γ , F α β = 1 2 F μ ν v μ ν α β , B α = 1 3 ! B λ μ ν ε λ μ ν α . (3.51) J α β γ = J μ ε μ α β γ , F α β = 1 2 F μ ν v μ ν α β , B α = 1 3 ! B λ μ ν ε λ μ ν α . {:(3.51)^(**)J_(alpha beta gamma)=J^(mu)epsi_(mu alpha beta gamma)","quad^(**)F_(alpha beta)=(1)/(2)F^(mu nu)v_(mu nu alpha beta)","quad^(**)B_(alpha)=(1)/(3!)B^(lambda mu nu)epsi_(lambda mu nu alpha).:}\begin{equation*} { }^{*} J_{\alpha \beta \gamma}=J^{\mu} \varepsilon_{\mu \alpha \beta \gamma}, \quad{ }^{*} F_{\alpha \beta}=\frac{1}{2} F^{\mu \nu} v_{\mu \nu \alpha \beta}, \quad{ }^{*} B_{\alpha}=\frac{1}{3!} B^{\lambda \mu \nu} \varepsilon_{\lambda \mu \nu \alpha} . \tag{3.51} \end{equation*}(3.51)Jαβγ=Jμεμαβγ,Fαβ=12Fμνvμναβ,Bα=13!Bλμνελμνα.
One calls J J J\boldsymbol{J}J the "dual" of J , F J , F J,^(**)F\boldsymbol{J},{ }^{*} \boldsymbol{F}J,F the dual of F F F\boldsymbol{F}F, and B B **B\boldsymbol{*} \boldsymbol{B}B the dual of B B B\boldsymbol{B}B. [A previous and entirely distinct use of the word "dual" ( $ 2.7 $ 2.7 $2.7\$ 2.7$2.7 ) called a set of basis one-forms { ω α } ω α {omega^(alpha)}\left\{\boldsymbol{\omega}^{\alpha}\right\}{ωα} dual to a set of basis vectors { e α } e α {e_(alpha)}\left\{\boldsymbol{e}_{\alpha}\right\}{eα} if ω α , e β = δ α β ω α , e β = δ α β (:omega^(alpha),e_(beta):)=delta^(alpha)_(beta)\left\langle\boldsymbol{\omega}^{\alpha}, \boldsymbol{e}_{\beta}\right\rangle=\delta^{\alpha}{ }_{\beta}ωα,eβ=δαβ. Fortunately there are no grounds for confusion between the two types of duality. One relates sets of vectors to sets of 1 -forms. The other relates antisymmetric tensors of rank p p ppp to antisymmetric tensors of rank 4 p 4 p 4-p4-p4p.]
(a) Show that
(3.52) J = J , F = F , B = B (3.52) J = J , F = F , B = B {:(3.52)^(****)J=J","quad^(****)F=-F","quad^(****)B=B:}\begin{equation*} { }^{* *} \boldsymbol{J}=\boldsymbol{J}, \quad{ }^{* *} \boldsymbol{F}=-\boldsymbol{F}, \quad{ }^{* *} \boldsymbol{B}=\boldsymbol{B} \tag{3.52} \end{equation*}(3.52)J=J,F=F,B=B
so (aside from sign) one can recover any completely antisymmetric tensor H H H\boldsymbol{H}H from its dual H H ^(**)H{ }^{\boldsymbol{*}} \boldsymbol{H}H by taking the dual once again, H H ^(****)H{ }^{* *} \boldsymbol{H}H. This shows that H H H\boldsymbol{H}H and H H ^(**H){ }^{* \boldsymbol{H}}H contain precisely the same information.
(b) Make explicit this fact of same-information-content by writing out the components A α β γ A α β γ ^(**)A^(alpha beta gamma){ }^{*} A^{\alpha \beta \gamma}Aαβγ in terms of A α A α A^(alpha)A^{\alpha}Aα, also F α β F α β ^(**)F^(alpha beta){ }^{*} F^{\alpha \beta}Fαβ in terms of F α β F α β F^(alpha beta)F^{\alpha \beta}Fαβ, also B α B α ^(**)B^(alpha){ }^{*} B^{\alpha}Bα in terms of B α β γ B α β γ B^(alpha beta gamma)B^{\alpha \beta \gamma}Bαβγ.

Exercise 3.15. GEOMETRIC VERSIONS OF MAXWELL EQUATIONS

Show that, if F F F\boldsymbol{F}F is the electromagnetic field tensor, then F = 0 F = 0 grad*^(**)F=0\boldsymbol{\nabla} \cdot{ }^{\boldsymbol{*}} \boldsymbol{F}=0F=0 is a geometric frame-independent version of the Maxwell equations
F α β , γ + F β γ , α + F γ α , β = 0 F α β , γ + F β γ , α + F γ α , β = 0 F_(alpha beta,gamma)+F_(beta gamma,alpha)+F_(gamma alpha,beta)=0F_{\alpha \beta, \gamma}+F_{\beta \gamma, \alpha}+F_{\gamma \alpha, \beta}=0Fαβ,γ+Fβγ,α+Fγα,β=0
Similarly show that F = 4 π J F = 4 π J grad*F=4pi J\boldsymbol{\nabla} \cdot \boldsymbol{F}=4 \pi \boldsymbol{J}F=4πJ (divergence on second slot of F F F\boldsymbol{F}F ) is a geometric version of F α β , β = 4 π J α F α β , β = 4 π J α F^(alpha beta)_(,beta)=4piJ^(alpha)F^{\alpha \beta}{ }_{, \beta}=4 \pi J^{\alpha}Fαβ,β=4πJα.

Exercise 3.16. CHARGE CONSERVATION

From Maxwell's equations F α β , β = 4 π J α F α β , β = 4 π J α F^(alpha beta)_(,beta)=4piJ^(alpha)F^{\alpha \beta}{ }_{, \beta}=4 \pi J^{\alpha}Fαβ,β=4πJα, derive the "equation of charge conservation"
(3.53) J α , α = 0 . (3.53) J α , α = 0 . {:(3.53)J^(alpha)_(,alpha)=0.:}\begin{equation*} J^{\alpha}{ }_{, \alpha}=0 . \tag{3.53} \end{equation*}(3.53)Jα,α=0.
Show that this equation does, indeed, correspond to conservation of charge. It will be studied further in Chapter 5.

Exercise 3.17. VECTOR POTENTIAL

The vector potential A A A\boldsymbol{A}A of electromagnetic theory generates the electromagnetic field tensor via the geometric equation
(3.54) F = ( antisymmetric part of A ) , (3.54) F = (  antisymmetric part of  A ) , {:(3.54)F=-(" antisymmetric part of "grad A)",":}\begin{equation*} \boldsymbol{F}=-(\text { antisymmetric part of } \boldsymbol{\nabla} \boldsymbol{A}), \tag{3.54} \end{equation*}(3.54)F=( antisymmetric part of A),
i.e.,
( ) F μ ν = A ν , μ A μ , ν . ( ) F μ ν = A ν , μ A μ , ν . {:('")"F_(mu nu)=A_(nu,mu)-A_(mu,nu).:}\begin{equation*} F_{\mu \nu}=A_{\nu, \mu}-A_{\mu, \nu} . \tag{$\prime$} \end{equation*}()Fμν=Aν,μAμ,ν.
(a) Show that the electric and magnetic fields in a specific Lorentz frame are given by
(3.55) B = × A , E = A / t A 0 (3.55) B = × A , E = A / t A 0 {:(3.55)B=grad xx A","quad E=-del A//del t-gradA^(0):}\begin{equation*} \boldsymbol{B}=\boldsymbol{\nabla} \times \boldsymbol{A}, \quad \boldsymbol{E}=-\partial \boldsymbol{A} / \partial t-\boldsymbol{\nabla} A^{0} \tag{3.55} \end{equation*}(3.55)B=×A,E=A/tA0
(b) Show that F F F\boldsymbol{F}F will satisfy Maxwell's equations if and only if A A A\boldsymbol{A}A satisfies
(3.56) A , μ α , μ A , μ μ , α = 4 π J α (3.56) A , μ α , μ A , μ μ , α = 4 π J α {:(3.56)A_(,mu)^(alpha,mu)-A_(,mu)^(mu,alpha)=-4piJ^(alpha):}\begin{equation*} A_{, \mu}^{\alpha, \mu}-A_{, \mu}^{\mu, \alpha}=-4 \pi J^{\alpha} \tag{3.56} \end{equation*}(3.56)A,μα,μA,μμ,α=4πJα
(c) Show that "gauge transformations"
(3.57) A NEW = A OLD + d ϕ , ϕ = arbitrary function (3.57) A NEW = A OLD + d ϕ , ϕ =  arbitrary function  {:(3.57)A_(NEW)=A_(OLD)+d phi","quad phi=" arbitrary function ":}\begin{equation*} \boldsymbol{A}_{\mathrm{NEW}}=\boldsymbol{A}_{\mathrm{OLD}}+\boldsymbol{d} \phi, \quad \phi=\text { arbitrary function } \tag{3.57} \end{equation*}(3.57)ANEW=AOLD+dϕ,ϕ= arbitrary function 
leave F F F\boldsymbol{F}F unaffected.
(d) Show that one can adjust the gauge so that
(3.58a) A = 0 ("Lorentz gauge") (3.58b) A = 4 π J . (3.58a) A = 0  ("Lorentz gauge")  (3.58b) A = 4 π J . {:[(3.58a)grad*A=0quad" ("Lorentz gauge") "],[(3.58b)◻A=-4pi J.]:}\begin{align*} \boldsymbol{\nabla} \cdot \boldsymbol{A} & =0 \quad \text { ("Lorentz gauge") } \tag{3.58a}\\ \square \boldsymbol{A} & =-4 \pi \boldsymbol{J} . \tag{3.58b} \end{align*}(3.58a)A=0 ("Lorentz gauge") (3.58b)A=4πJ.
Hereis the wave operator ("d'Alembertian"):
(3.59) A = A , μ α , μ e α (3.59) A = A , μ α , μ e α {:(3.59)◻A=A_(,mu)^(alpha,mu)e_(alpha):}\begin{equation*} \square \boldsymbol{A}=A_{, \mu}^{\alpha, \mu} \boldsymbol{e}_{\alpha} \tag{3.59} \end{equation*}(3.59)A=A,μα,μeα

Exercise 3.18. DIVERGENCE OF ELECTROMAGNETIC STRESS-ENERGY TENSOR

From an electromagnetic field tensor F F F\boldsymbol{F}F, one constructs a second-rank, symmetric tensor T T T\boldsymbol{T}T ("stress-energy tensor," to be studied in Chapter 5) as follows:
(3.60) T μ ν = 1 4 π ( F μ α F α ν 1 4 η μ ν F α β F α β ) (3.60) T μ ν = 1 4 π F μ α F α ν 1 4 η μ ν F α β F α β {:(3.60)T^(mu nu)=(1)/(4pi)(F^(mu alpha)F_(alpha)^(nu)-(1)/(4)eta^(mu nu)F_(alpha beta)F^(alpha beta)):}\begin{equation*} T^{\mu \nu}=\frac{1}{4 \pi}\left(F^{\mu \alpha} F_{\alpha}^{\nu}-\frac{1}{4} \eta^{\mu \nu} F_{\alpha \beta} F^{\alpha \beta}\right) \tag{3.60} \end{equation*}(3.60)Tμν=14π(FμαFαν14ημνFαβFαβ)
As an exercise in index gymnastics:
(a) Show that T T grad*T\boldsymbol{\nabla} \cdot \boldsymbol{T}T has components
(3.61) T μ ν , ν = 1 4 π [ F μ α , ν F v α + F μ α F α , ν ν 1 2 F α β , μ F α β ] . (3.61) T μ ν , ν = 1 4 π F μ α , ν F v α + F μ α F α , ν ν 1 2 F α β , μ F α β . {:(3.61)T^(mu nu)_(,nu)=(1)/(4pi)[F^(mu alpha)_(,nu)F^(v)_(alpha)+F^(mu alpha)F_(alpha,nu)^(nu)-(1)/(2)F_(alpha beta),muF^(alpha beta)].:}\begin{equation*} T^{\mu \nu}{ }_{, \nu}=\frac{1}{4 \pi}\left[F^{\mu \alpha}{ }_{, \nu} F^{v}{ }_{\alpha}+F^{\mu \alpha} F_{\alpha, \nu}^{\nu}-\frac{1}{2} F_{\alpha \beta}, \mu F^{\alpha \beta}\right] . \tag{3.61} \end{equation*}(3.61)Tμν,ν=14π[Fμα,νFvα+FμαFα,νν12Fαβ,μFαβ].
(b) Manipulate this expression into the form
(3.62) T μ ν , ν = 1 4 π [ F μ α F α ν , ν 1 2 F α β ( F α β , μ + F μ α , β + F β μ , α ) ] (3.62) T μ ν , ν = 1 4 π F μ α F α ν , ν 1 2 F α β F α β , μ + F μ α , β + F β μ , α {:(3.62)T_(mu)^(nu)","nu=(1)/(4pi)[-F_(mu alpha)F^(alpha nu)_(,nu)-(1)/(2)F^(alpha beta)(F_(alpha beta,mu)+F_(mu alpha,beta)+F_(beta mu,alpha))]:}\begin{equation*} T_{\mu}{ }^{\nu}, \nu=\frac{1}{4 \pi}\left[-F_{\mu \alpha} F^{\alpha \nu}{ }_{, \nu}-\frac{1}{2} F^{\alpha \beta}\left(F_{\alpha \beta, \mu}+F_{\mu \alpha, \beta}+F_{\beta \mu, \alpha}\right)\right] \tag{3.62} \end{equation*}(3.62)Tμν,ν=14π[FμαFαν,ν12Fαβ(Fαβ,μ+Fμα,β+Fβμ,α)]
note that the first term of (3.62) arises directly from the second term of (3.61).
(c) Use Maxwell's equations to conclude that
(3.63) T μ ν , p = F μ α J α (3.63) T μ ν , p = F μ α J α {:(3.63)T^(mu nu)_(,p)=-F^(mu alpha)J_(alpha):}\begin{equation*} T^{\mu \nu}{ }_{, p}=-F^{\mu \alpha} J_{\alpha} \tag{3.63} \end{equation*}(3.63)Tμν,p=FμαJα

ELECTROMAGNETISM AND DIFFERENTIAL FORMS

The ether trembled at his agitations In a manner so familiar that I only need to say, In accordance with Clerk Maxwell's six equations
It tickled peoples' optics far away. You can feel the way it's done, You may trace them as they run-
d γ d γ dgamma\mathrm{d} \gammadγ by dy less d β d β dbeta\mathrm{d} \betadβ by dz is equal KdX / dt KdX / dt KdX//dt\mathrm{KdX} / \mathrm{dt}KdX/dt
While the curl of ( X , Y , Z ) ( X , Y , Z ) (X,Y,Z)(\mathrm{X}, \mathrm{Y}, \mathrm{Z})(X,Y,Z) is the minus d / dt d / dt d//dt\mathrm{d} / \mathrm{dt}d/dt of the vector ( a , b , c ) ( a , b , c ) (a,b,c)(\mathrm{a}, \mathrm{b}, \mathrm{c})(a,b,c).
From The Revolution of the Corpuscle,
written by A. A. Robb
(to the tune of The Interfering Parrott) for a dinner of the research students of the Cavendish Laboratory in the days of the old mathematics.
This chapter is all Track 2. It is needed as preparation for § § 14.5 § § 14.5 §§14.5\S \S 14.5§§14.5 and 14.6 (computation of curvature using differential forms) and for Chapter 15 (Bianchi identities and boundary of a boundary), but is not needed for the rest of the book.

§4.1. EXTERIOR CALCULUS

Stacks of surfaces, individually or intersecting to make "honeycombs," "egg crates," and other such structures ("differential forms"), give unique insight into the geometry of electromagnetism and gravitation. However, such insight comes at some cost in time. Therefore, most readers should skip this chapter and later material that depends on it during a first reading of this book.
Analytically speaking, differential forms are completely antisymmetric tensors; pictorially speaking, they are intersecting stacks of surfaces. The mathematical formalism for manipulating differential forms with ease, called "exterior calculus," is summarized concisely in Box 4.1 ; its basic features are illustrated in the rest of this chapter by rewriting electromagnetic theory in its language. An effective way to tackle this chapter might be to (1) scan Box 4.1 to get the flavor of the formalism; (2) read the rest of the chapter in detail; (3) restudy Box 4.1 carefully; (4) get practice in manipulating the formalism by working the exercises.*
(continued on page 99)
*Exterior calculus is treated in greater detail than here by: E. Cartan (1945); de Rham (1955); Nickerson, Spencer, and Steenrod (1959); Hauser (1970); Israel (1970); especially Flanders (1963, relatively easy, with many applications); Spivak (1965, sophomore or junior level, but fully in tune with modern mathematics); H. Cartan (1970); and Choquet-Bruhat (1968a).

Box 4.1 DIFFERENTIAL FORMS AND EXTERIOR CALCULUS IN BRIEF

The fundamental definitions and formulas of exterior calculus are summarized here for ready reference. Each item consists of a general statement (at left of page) plus a leading application (at right of page). This formalism is applicable not only to spacetime, but also to more general geometrical systems (see heading of each section). No attempt is made here to demonstrate the internal consistency of the formalism, nor to derive it from any set of definitions and axioms. For a systematic treatment that does so, see, e.g., Spivak (1965), or Misner and Wheeler (1957).

A. Algebra I (applicable to any vector space)

  1. Basis 1-forms.
    a. Coordinate basis ω j = d x j ω j = d x j omega^(j)=dx^(j)\boldsymbol{\omega}^{j}=\boldsymbol{d} x^{j}ωj=dxj
    ( j j jjj tells which 1-form, not which component).
    b. General basis ω j = L k j d x k ω j = L k j d x k omega^(j)=L_(k^('))^(j)dx^(k^('))\boldsymbol{\omega}^{j}=L_{k^{\prime}}^{j} \boldsymbol{d} x^{k^{\prime}}ωj=Lkjdxk.

An application

Simple basis 1 -forms for analyzing Schwarzschild geometry around static spherically symmetric center of attraction:
ω 0 = ( 1 2 m / r ) 1 / 2 d t ; ω 1 = ( 1 2 m / r ) 1 / 2 d r ; ω 2 = r d θ ; ω 3 = r sin θ d ϕ . ω 0 = ( 1 2 m / r ) 1 / 2 d t ; ω 1 = ( 1 2 m / r ) 1 / 2 d r ; ω 2 = r d θ ; ω 3 = r sin θ d ϕ . {:[omega^(0)=(1-2m//r)^(1//2)dt;],[omega^(1)=(1-2m//r)^(-1//2)dr;],[omega^(2)=rd theta;],[omega^(3)=r sin theta d phi.]:}\begin{aligned} & \boldsymbol{\omega}^{0}=(1-2 m / r)^{1 / 2} \boldsymbol{d} t ; \\ & \boldsymbol{\omega}^{1}=(1-2 m / r)^{-1 / 2} \boldsymbol{d} r ; \\ & \boldsymbol{\omega}^{2}=r \boldsymbol{d} \theta ; \\ & \boldsymbol{\omega}^{3}=r \sin \theta \boldsymbol{d} \phi . \end{aligned}ω0=(12m/r)1/2dt;ω1=(12m/r)1/2dr;ω2=rdθ;ω3=rsinθdϕ.
  1. General p-form (or p-vector) is a completely antisymmetric tensor of rank ( 0 p ) [ ( 0 p ) ((0)/(p))[:}\binom{0}{p}\left[\right.(0p)[ or ( p 0 ) ] ( p 0 ) {:((p)/(0))]\left.\binom{p}{0}\right](p0)]. It can be expanded in terms of wedge products (see § 3.5 § 3.5 §3.5\S 3.5§3.5 and exercise 4.12):
α = 1 p ! α i 1 i 2 i p ω i 1 ω i 2 ω i p α i 1 i 2 i p ω i 1 ω i 2 ω i p . α = 1 p ! α i 1 i 2 i p ω i 1 ω i 2 ω i p α i 1 i 2 i p ω i 1 ω i 2 ω i p . {:[alpha=(1)/(p!)alpha_(i_(1)i_(2)dotsi_(p))omega^(i_(1))^^omega^(i_(2))^^cdots^^omega^(i_(p))],[-=alpha_(∣i_(1)i_(2)dotsi_(p))omega^(i_(1))^^omega^(i_(2))^^cdots^^omega^(i_(p)).]:}\begin{aligned} \boldsymbol{\alpha} & =\frac{1}{p!} \alpha_{i_{1} i_{2} \ldots i_{p}} \boldsymbol{\omega}^{i_{1}} \wedge \boldsymbol{\omega}^{i_{2}} \wedge \cdots \wedge \boldsymbol{\omega}^{i_{p}} \\ & \equiv \alpha_{\mid i_{1} i_{2} \ldots i_{p}} \boldsymbol{\omega}^{i_{1}} \wedge \boldsymbol{\omega}^{i_{2}} \wedge \cdots \wedge \boldsymbol{\omega}^{i_{p}} . \end{aligned}α=1p!αi1i2ipωi1ωi2ωipαi1i2ipωi1ωi2ωip.
(Note: Vertical bars around the indices mean summation extends only over i 1 < i 2 < < i p i 1 < i 2 < < i p i_(1) < i_(2) < cdots < i_(p)i_{1}<i_{2}<\cdots<i_{p}i1<i2<<ip.)

Two applications

Energy-momentum 1-form is of type α = α i ω i α = α i ω i alpha=alpha_(i)omega^(i)\boldsymbol{\alpha}=\alpha_{i} \boldsymbol{\omega}^{i}α=αiωi or
p = E d t + p x d x + p y d y + p z d z p = E d t + p x d x + p y d y + p z d z p=-Edt+p_(x)dx+p_(y)dy+p_(z)dz\boldsymbol{p}=-E \boldsymbol{d} t+p_{x} \boldsymbol{d} x+p_{y} \boldsymbol{d} y+p_{z} \boldsymbol{d} zp=Edt+pxdx+pydy+pzdz
Faraday is a 2 -form of type β = β | μ ν | ω μ ω ν β = β | μ ν | ω μ ω ν beta=beta_(|mu nu|)omega^(mu)^^omega^(nu)\boldsymbol{\beta}=\beta_{|\mu \nu|} \boldsymbol{\omega}^{\mu} \wedge \boldsymbol{\omega}^{\nu}β=β|μν|ωμων or in flat spacetime
F = E x d t d x E y d t d y E z d t d z + B x d y d z + B y d z d x + B z d x d y F = E x d t d x E y d t d y E z d t d z + B x d y d z + B y d z d x + B z d x d y {:[F=-E_(x)dt^^dx-E_(y)dt^^dy-E_(z)dt^^dz],[+B_(x)dy^^dz+B_(y)dz^^dx+B_(z)dx^^dy]:}\begin{aligned} \boldsymbol{F}= & -E_{x} \boldsymbol{d} t \wedge \boldsymbol{d} x-E_{y} \boldsymbol{d} t \wedge \boldsymbol{d} y-E_{z} \boldsymbol{d} t \wedge \boldsymbol{d} z \\ & +B_{x} \boldsymbol{d} y \wedge \boldsymbol{d} z+B_{y} \boldsymbol{d} z \wedge \boldsymbol{d} x+B_{z} \boldsymbol{d} x \wedge \boldsymbol{d} y \end{aligned}F=ExdtdxEydtdyEzdtdz+Bxdydz+Bydzdx+Bzdxdy
Box 4.1 (continued)
3. Wedge product.
All familiar rules of addition and multiplication hold, such as
( a α + b β ) γ = a α γ + b β γ , ( α β ) γ = α ( β γ ) α β γ , ( a α + b β ) γ = a α γ + b β γ , ( α β ) γ = α ( β γ ) α β γ , {:[(a alpha+b beta)^^gamma=a alpha^^gamma+b beta^^gamma","],[(alpha^^beta)^^gamma=alpha^^(beta^^gamma)-=alpha^^beta^^gamma","]:}\begin{aligned} & (a \boldsymbol{\alpha}+b \boldsymbol{\beta}) \wedge \boldsymbol{\gamma}=a \boldsymbol{\alpha} \wedge \boldsymbol{\gamma}+b \boldsymbol{\beta} \wedge \boldsymbol{\gamma}, \\ & (\alpha \wedge \beta) \wedge \gamma=\alpha \wedge(\beta \wedge \gamma) \equiv \alpha \wedge \beta \wedge \gamma, \end{aligned}(aα+bβ)γ=aαγ+bβγ,(αβ)γ=α(βγ)αβγ,
except for a modified commutation law between
a p p ppp-form α α alpha\boldsymbol{\alpha}α and a q q qqq-form β β beta\boldsymbol{\beta}β :
α p β q = ( 1 ) p q β q α p . α p β q = ( 1 ) p q β q α p . alpha _(p)^^beta _(q)=(-1)^(pq)beta _(q)^^alpha _(p).\underset{p}{\boldsymbol{\alpha}} \wedge \underset{q}{\boldsymbol{\beta}}=(-1)^{p q} \underset{q}{\boldsymbol{\beta}} \wedge \underset{p}{\boldsymbol{\alpha}} .αpβq=(1)pqβqαp.
Applications to 1-forms α , β α , β alpha,beta\boldsymbol{\alpha}, \boldsymbol{\beta}α,β :
α β = β a , α α = 0 α β = ( α j ω j ) ( β k ω k ) = α j β k ω j ω k = 1 2 ( α j β k β j α k ) ω j ω k . α β = β a , α α = 0 α β = α j ω j β k ω k = α j β k ω j ω k = 1 2 α j β k β j α k ω j ω k . {:[alpha^^beta=-beta^^a","quad alpha^^alpha=0],[alpha^^beta=(alpha_(j)omega^(j))^^(beta_(k)omega^(k))=alpha_(j)beta_(k)omega^(j)^^omega^(k)],[=(1)/(2)(alpha_(j)beta_(k)-beta_(j)alpha_(k))omega^(j)^^omega^(k).]:}\begin{aligned} \boldsymbol{\alpha} \wedge \boldsymbol{\beta} & =-\boldsymbol{\beta} \wedge \boldsymbol{a}, \quad \boldsymbol{\alpha} \wedge \boldsymbol{\alpha}=0 \\ \boldsymbol{\alpha} \wedge \boldsymbol{\beta} & =\left(\alpha_{j} \boldsymbol{\omega}^{j}\right) \wedge\left(\beta_{k} \boldsymbol{\omega}^{k}\right)=\alpha_{j} \beta_{k} \boldsymbol{\omega}^{j} \wedge \boldsymbol{\omega}^{k} \\ & =\frac{1}{2}\left(\alpha_{j} \beta_{k}-\beta_{j} \alpha_{k}\right) \boldsymbol{\omega}^{j} \wedge \boldsymbol{\omega}^{k} . \end{aligned}αβ=βa,αα=0αβ=(αjωj)(βkωk)=αjβkωjωk=12(αjβkβjαk)ωjωk.
  1. Contraction of p -form on p -vector.
    α p , A p α p , A p (:alpha _(p),A_(p):)\langle\underset{p}{\alpha}, \underset{p}{\boldsymbol{A}}\rangleαp,Ap
= α | i 1 i p | A | j 1 j p | δ j 1 i p i p , i p ( see exercises 3.13 and 4.12) ] [ ω i 1 ω i p , e j 1 e j p = α | i 1 i p | | i 1 i p . = α i 1 i p A j 1 j p δ j 1 i p i p , i p (  see exercises  3.13  and 4.12) ] ω i 1 ω i p , e j 1 e j p = α i 1 i p i 1 i p . {:[=alpha_(|i_(1)dotsi_(p)|)A^(|j_(1)dotsj_(p)|)(:ubrace(-=delta_(j_(1)dotsi_(p))^(i_(p),i_(p))(" see exercises "3.13" and 4.12)"]ubrace)_([omega^(i_(1))^^cdots^^omega^(i_(p)),e_(j_(1))^^cdots^^e_(j_(p)):))],[=alpha_(|i_(1)dotsi_(p)|)|^(i_(1)dotsi_(p).)]:}\begin{aligned} & =\alpha_{\left|i_{1} \ldots i_{p}\right|} A^{\left|j_{1} \ldots j_{p}\right|}\langle\underbrace{\equiv \delta_{j_{1} \ldots i_{p}}^{i_{p}, i_{p}}(\text { see exercises } 3.13 \text { and 4.12)}]}_{\left[\boldsymbol{\omega}^{i_{1}} \wedge \cdots \wedge \boldsymbol{\omega}^{i_{p}}, \boldsymbol{e}_{j_{1}} \wedge \cdots \wedge \boldsymbol{e}_{j_{p}}\right\rangle} \\ & =\left.\alpha_{\left|i_{1} \ldots i_{p}\right|}\right|^{i_{1} \ldots i_{p} .} \end{aligned}=α|i1ip|A|j1jp|δj1ipip,ip( see exercises 3.13 and 4.12)][ωi1ωip,ej1ejp=α|i1ip||i1ip.
Four applications
a. Contraction of a particle's energy-momentum 1-form p = p α ω α p = p α ω α p=p_(alpha)omega^(alpha)\boldsymbol{p}=p_{\alpha} \boldsymbol{\omega}^{\alpha}p=pαωα with 4-velocity u = u α e α u = u α e α u=u^(alpha)e_(alpha)\boldsymbol{u}=u^{\alpha} \boldsymbol{e}_{\alpha}u=uαeα of observer (a 1-vector):
p , u = p α u α = energy of particle. p , u = p α u α =  energy of particle.  -(:p,u:)=-p_(alpha)u^(alpha)=" energy of particle. "-\langle\boldsymbol{p}, \boldsymbol{u}\rangle=-p_{\alpha} u^{\alpha}=\text { energy of particle. }p,u=pαuα= energy of particle. 
b. Contraction of Faraday 2 -form F F F\boldsymbol{F}F with bivector δ P Δ P δ P Δ P deltaP^^DeltaP\delta \mathscr{P} \wedge \Delta \mathscr{P}δPΔP [where δ P = ( d P / d λ 1 ) Δ λ 1 δ P = d P / d λ 1 Δ λ 1 deltaP=(dP//dlambda_(1))Deltalambda_(1)\delta \mathscr{P}=\left(d \mathscr{P} / d \lambda_{1}\right) \Delta \lambda_{1}δP=(dP/dλ1)Δλ1 and Δ P = Δ P = DeltaP=\Delta \mathscr{P}=ΔP= ( d P / d X 2 ) Δ λ 2 d P / d X 2 Δ λ 2 (dP//dX_(2))Deltalambda_(2)\left(d \mathscr{P} / d \mathrm{X}_{2}\right) \Delta \lambda_{2}(dP/dX2)Δλ2 are two infinitesimal vectors in a 2 -surface P ( λ 1 , λ 2 ) P λ 1 , λ 2 P(lambda_(1),lambda_(2))\mathscr{P}\left(\lambda_{1}, \lambda_{2}\right)P(λ1,λ2), and the bivector represents the surface element they span] is the magnetic flux Φ = F , δ P Φ = F , δ P Phi=(:F,deltaP\Phi=\langle\boldsymbol{F}, \delta \mathscr{P}Φ=F,δP Δ P Δ P ^^DeltaP:)\wedge \Delta \mathscr{P}\rangleΔP through that surface element.
c. More generally, a p p ppp-dimensional parallelepiped with vectors a 1 , a 2 , , a p a 1 , a 2 , , a p a_(1),a_(2),dots,a_(p)\boldsymbol{a}_{1}, \boldsymbol{a}_{2}, \ldots, \boldsymbol{a}_{p}a1,a2,,ap for legs has an oriented volume described by the "simple" p p ppp-vector a 1 a 2 a p a 1 a 2 a p a_(1)^^a_(2)^^cdotsa_(p)\boldsymbol{a}_{1} \wedge \boldsymbol{a}_{2} \wedge \cdots \boldsymbol{a}_{p}a1a2ap (oriented because interchange of two legs changes its sign). An egg-crate type of structure with walls made from the hyperplanes of p p ppp different 1 -forms σ 1 σ 1 sigma^(1)\sigma^{1}σ1,
σ 2 , , σ p σ 2 , , σ p sigma^(2),dots,sigma^(p)\boldsymbol{\sigma}^{2}, \ldots, \boldsymbol{\sigma}^{p}σ2,,σp is described by the "simple" p p ppp-form σ 1 σ 1 sigma^(1)\boldsymbol{\sigma}^{1}σ1 σ 2 σ p σ 2 σ p ^^sigma^(2)^^cdots^^sigma^(p)\wedge \boldsymbol{\sigma}^{2} \wedge \cdots \wedge \boldsymbol{\sigma}^{p}σ2σp. The number of cells of σ 1 σ 1 sigma^(1)^^\boldsymbol{\sigma}^{1} \wedgeσ1 σ 2 σ p σ 2 σ p sigma^(2)^^cdots^^sigma^(p)\sigma^{2} \wedge \cdots \wedge \sigma^{p}σ2σp sliced through by the infinitesimal p p ppp-volume a 1 a 2 a p a 1 a 2 a p a_(1)^^a_(2)^^cdots^^a_(p)\boldsymbol{a}_{1} \wedge \boldsymbol{a}_{2} \wedge \cdots \wedge \boldsymbol{a}_{p}a1a2ap is
σ 1 σ 2 σ p , a 1 a 2 a p σ 1 σ 2 σ p , a 1 a 2 a p (:sigma^(1)^^sigma^(2)^^cdots^^sigma^(p),a_(1)^^a_(2)^^cdots^^a_(p):)\left\langle\sigma^{1} \wedge \sigma^{2} \wedge \cdots \wedge \boldsymbol{\sigma}^{p}, a_{1} \wedge a_{2} \wedge \cdots \wedge a_{p}\right\rangleσ1σ2σp,a1a2ap
d. The Jacobian determinant of a set of p p ppp functions f k ( x 1 , , x n ) f k x 1 , , x n f^(k)(x^(1),dots,x^(n))f^{k}\left(x^{1}, \ldots, x^{n}\right)fk(x1,,xn) with respect to p p ppp of their arguments is
d f 1 d f 2 d f p , P x 1 P x 2 P x p = det ( f k x j ) ( f 1 , f 2 , , f p ) ( x 1 , x 2 , , x p ) . d f 1 d f 2 d f p , P x 1 P x 2 P x p = det f k x j f 1 , f 2 , , f p x 1 , x 2 , , x p . {:[(:df^(1)^^df^(2)^^dots:}{:^^df^(p),(delP)/(delx^(1))^^(delP)/(delx^(2))^^dots^^(delP)/(delx^(p)):)],[=det||((delf^(k))/(delx^(j)))||-=(del(f^(1),f^(2),dots,f^(p)))/(del(x^(1),x^(2),dots,x^(p))).]:}\begin{aligned} \left\langle\boldsymbol{d} f^{1} \wedge \boldsymbol{d} f^{2} \wedge \ldots\right. & \left.\wedge \boldsymbol{d} f^{p}, \frac{\partial \mathscr{P}}{\partial x^{1}} \wedge \frac{\partial \mathscr{P}}{\partial x^{2}} \wedge \ldots \wedge \frac{\partial \mathscr{P}}{\partial x^{p}}\right\rangle \\ & =\operatorname{det}\left\|\left(\frac{\partial f^{k}}{\partial x^{j}}\right)\right\| \equiv \frac{\partial\left(f^{1}, f^{2}, \ldots, f^{p}\right)}{\partial\left(x^{1}, x^{2}, \ldots, x^{p}\right)} . \end{aligned}df1df2dfp,Px1Px2Pxp=det(fkxj)(f1,f2,,fp)(x1,x2,,xp).
  1. Simple forms.
    a. A simple p p ppp-form is one that can be written as a wedge product of p 1 p 1 p1p 1p1-forms:
σ p = α β γ p factors σ p = α β γ p  factors  sigma _(p)=ubrace(alpha^^beta^^cdots^^gammaubrace)_(p" factors ")\underset{p}{\boldsymbol{\sigma}}=\underbrace{\boldsymbol{\alpha} \wedge \boldsymbol{\beta} \wedge \cdots \wedge \boldsymbol{\gamma}}_{p \text { factors }}σp=αβγp factors 
b. A simple p p ppp-form α β γ α β γ alpha^^beta^^cdots^^gamma\alpha \wedge \beta \wedge \cdots \wedge \boldsymbol{\gamma}αβγ is represented by the intersecting families of surfaces of α , β , , γ α , β , , γ alpha,beta,dots,gamma\boldsymbol{\alpha}, \boldsymbol{\beta}, \ldots, \boldsymbol{\gamma}α,β,,γ (egg-crate structure) plus a sense of circulation (orientation).

Applications:

a. In four dimensions (e.g., spacetime) all 0 -forms, 1 forms, 3 -forms, and 4 -forms are simple. A 2 -form F F F\boldsymbol{F}F is generally a sum of two simple forms, e.g., F = F = F=\boldsymbol{F}=F= e d t d x + h d y d z e d t d x + h d y d z -edt^^dx+hdy^^dz-e \boldsymbol{d} t \wedge \boldsymbol{d} x+h \boldsymbol{d} y \wedge \boldsymbol{d} zedtdx+hdydz; it is simple if and only if F F = 0 F F = 0 F^^F=0\boldsymbol{F} \wedge \boldsymbol{F}=0FF=0.
b. A set of 1 -forms α , β , , γ α , β , , γ alpha,beta,dots,gamma\boldsymbol{\alpha}, \boldsymbol{\beta}, \ldots, \boldsymbol{\gamma}α,β,,γ is linearly dependent (one a linear combination of the others) if and only if
α β γ = 0 α β γ = 0 alpha^^beta^^cdots^^gamma=0quad\alpha \wedge \beta \wedge \cdots \wedge \boldsymbol{\gamma}=0 \quadαβγ=0 (egg crate collapsed).
B. Exterior Derivative (applicable to any "differentiable manifold," with or without metric)
  1. d d d\boldsymbol{d}d produces a ( p + 1 ) ( p + 1 ) (p+1)(p+1)(p+1)-form d σ d σ d sigma\boldsymbol{d} \boldsymbol{\sigma}dσ from a p p ppp-form σ σ sigma\boldsymbol{\sigma}σ.
  2. Effect of d d d\boldsymbol{d}d is defined by induction using the

Box 4.1 (continued)

(Chapter 2) definition of d f d f df\boldsymbol{d} fdf, and f f fff a function ( 0 0 0-0-0 form), plus
d ( α p β q β = d α β + ( 1 ) p α d β d 2 = d d = 0 d α p β q β = d α β + ( 1 ) p α d β d 2 = d d = 0 {:[d(alpha _(p)^^beta _(q)^(beta):}=d alpha^^beta+(-1)^(p)alpha^^d beta],[d^(2)=dd=0]:}\begin{aligned} \boldsymbol{d}\left(\underset{p}{\alpha} \wedge{\underset{q}{\beta}}^{\boldsymbol{\beta}}\right. & =\boldsymbol{d} \boldsymbol{\alpha} \wedge \boldsymbol{\beta}+(-1)^{p} \boldsymbol{\alpha} \wedge \boldsymbol{d} \boldsymbol{\beta} \\ \boldsymbol{d}^{2} & =\boldsymbol{d} \boldsymbol{d}=0 \end{aligned}d(αpβqβ=dαβ+(1)pαdβd2=dd=0

Two applications

d ( α d β ) = d α d β d ( α d β ) = d α d β d(alpha^^d beta)=d alpha^^d betad(\alpha \wedge d \beta)=d \alpha \wedge d \betad(αdβ)=dαdβ
For the p p ppp-form ϕ ϕ phi\boldsymbol{\phi}ϕ, with
ϕ = ϕ | i 1 i i | d x i 1 d x i p ϕ = ϕ i 1 i i d x i 1 d x i p phi=phi_(|i_(1)dotsi_(i)|)dx^(i_(1))^^cdots^^dx^(i_(p))\boldsymbol{\phi}=\phi_{\left|i_{1} \ldots i_{i}\right|} \boldsymbol{d} x^{i_{1}} \wedge \cdots \wedge \boldsymbol{d} x^{i_{p}}ϕ=ϕ|i1ii|dxi1dxip
one has (alternative and equivalent definition of d ϕ d ϕ d phi\boldsymbol{d} \boldsymbol{\phi}dϕ )
d ϕ = d ϕ | i 1 i p | d x i 1 d x i p d ϕ = d ϕ i 1 i p d x i 1 d x i p d phi=d_(phi_(|i_(1)dotsi_(p)|))^^dx^(i_(1))^^cdots^^dx^(i_(p))\boldsymbol{d} \boldsymbol{\phi}=\boldsymbol{d}_{\phi_{\left|i_{1} \ldots i_{p}\right|}} \wedge \boldsymbol{d} x^{i_{1}} \wedge \cdots \wedge \boldsymbol{d} x^{i_{p}}dϕ=dϕ|i1ip|dxi1dxip
C. Integration (applicable to any "differentiable manifold," with or without metric)
  1. Pictorial interpretation.
Text and pictures of Chapter 4 interpret α α int alpha\int \boldsymbol{\alpha}α (integral of specified 1-form α α alpha\boldsymbol{\alpha}α along specified curve from specified starting point to specified end point) as "number of α α alpha\boldsymbol{\alpha}α-surfaces pierced on that route"; similarly, they interpret ϕ ϕ int phi\int \boldsymbol{\phi}ϕ (integral of specified 2 -form ϕ ϕ phi\boldsymbol{\phi}ϕ over specified bit of surface on which there is an assigned sense of circulation or "orientation") as "number of cells of the honeycomb-like structure ϕ ϕ phi\boldsymbol{\phi}ϕ cut through by that surface"; similarly for the egg-crate-like structures that represent 3forms; etc.
2. Computational rules for integration.
To evaluate α α int alpha\int \boldsymbol{\alpha}α, the integral of a p p ppp-form
α = α | i 1 i p | ( x 1 , , x n ) d x i 1 d x i p α = α i 1 i p x 1 , , x n d x i 1 d x i p alpha=alpha_(|i_(1)dotsi_(p)|)(x^(1),dots,x^(n))dx^(i_(1))^^cdots^^dx^(i_(p))\boldsymbol{\alpha}=\alpha_{\left|i_{1} \ldots i_{p}\right|}\left(x^{1}, \ldots, x^{n}\right) \boldsymbol{d} x^{i_{1}} \wedge \cdots \wedge \boldsymbol{d} x^{i_{p}}α=α|i1ip|(x1,,xn)dxi1dxip
over a p p ppp-dimensional surface, proceed in two steps.
a. Substitute a parameterization of the surface,
x k ( λ 1 , , λ p ) x k λ 1 , , λ p x^(k)(lambda^(1),dots,lambda^(p))x^{k}\left(\lambda^{1}, \ldots, \lambda^{p}\right)xk(λ1,,λp)
into α α alpha\boldsymbol{\alpha}α, and collect terms in the form
a = a ( λ j ) d λ 1 d λ p a = a λ j d λ 1 d λ p a=a(lambda^(j))dlambda^(1)^^cdots^^dlambda^(p)\boldsymbol{a}=a\left(\lambda^{j}\right) \boldsymbol{d} \lambda^{1} \wedge \cdots \wedge \boldsymbol{d} \lambda^{p}a=a(λj)dλ1dλp
(this is α α alpha\boldsymbol{\alpha}α viewed as a p p ppp-form in the p p ppp-dimensional surface);
b. Integrate
a = a ( λ j ) d λ 1 d λ 2 d λ p a = a λ j d λ 1 d λ 2 d λ p int a=int a(lambda^(j))dlambda^(1)dlambda^(2)dots dlambda^(p)\int \boldsymbol{a}=\int a\left(\lambda^{j}\right) d \lambda^{1} d \lambda^{2} \ldots d \lambda^{p}a=a(λj)dλ1dλ2dλp
using elementary definition of integration.
Example: See equations (4.12) to (4.14).
3. The differential geometry of integration.
Calculate α α int alpha\int \boldsymbol{\alpha}α for a p p ppp-form α α alpha\boldsymbol{\alpha}α as follows.
a. Choose the p p ppp-dimensional surface S S SSS over which to integrate.
b. Represent S S SSS by a parametrization giving the generic point of the surface as a function of the parameters, P ( λ 1 , λ 2 , λ p ) P λ 1 , λ 2 , λ p P(lambda^(1),lambda^(2),dotslambda^(p))\mathscr{P}\left(\lambda^{1}, \lambda^{2}, \ldots \lambda^{p}\right)P(λ1,λ2,λp). This fixes the orientation. The same function with λ 1 λ 2 λ 1 λ 2 lambda^(1)harrlambda^(2)\lambda^{1} \leftrightarrow \lambda^{2}λ1λ2, P ( λ 2 , λ 1 , , λ p ) P λ 2 , λ 1 , , λ p P(lambda^(2),lambda^(1),dots,lambda^(p))\mathscr{P}\left(\lambda^{2}, \lambda^{1}, \ldots, \lambda^{p}\right)P(λ2,λ1,,λp), describes a different (i.e., oppositely oriented) surface, S S -S-SS.
c. The infinitesimal parallelepiped
( P λ 1 Δ λ 1 ) ( P λ 2 Δ λ 2 ) ( P λ p Δ λ p ) P λ 1 Δ λ 1 P λ 2 Δ λ 2 P λ p Δ λ p ((delP)/(dellambda^(1))Deltalambda^(1))^^((delP)/(dellambda^(2))Deltalambda^(2))^^cdots^^((delP)/(dellambda^(p))Deltalambda^(p))\left(\frac{\partial \mathscr{P}}{\partial \lambda^{1}} \Delta \lambda^{1}\right) \wedge\left(\frac{\partial \mathscr{P}}{\partial \lambda^{2}} \Delta \lambda^{2}\right) \wedge \cdots \wedge\left(\frac{\partial \mathscr{P}}{\partial \lambda^{p}} \Delta \lambda^{p}\right)(Pλ1Δλ1)(Pλ2Δλ2)(PλpΔλp)
is tangent to the surface. The number of cells of a a a\boldsymbol{a}a it slices is
α , P λ 1 P λ p Δ λ 1 Δ λ p α , P λ 1 P λ p Δ λ 1 Δ λ p (:alpha,(delP)/(dellambda^(1))^^dots^^(delP)/(dellambda^(p)):)Deltalambda^(1)dots Deltalambda^(p)\left\langle\boldsymbol{\alpha}, \frac{\partial \mathscr{P}}{\partial \lambda^{1}} \wedge \ldots \wedge \frac{\partial \mathscr{P}}{\partial \lambda^{p}}\right\rangle \Delta \lambda^{1} \ldots \Delta \lambda^{p}α,Pλ1PλpΔλ1Δλp
This number changes sign if two of the vectors P / λ k P / λ k delP//dellambda^(k)\partial \mathscr{P} / \partial \lambda^{k}P/λk are interchanged, as for an oppositely oriented surface.
d. The above provides an interpretation motivating the definition
α α , P λ 1 P λ 2 P λ p d λ 1 d λ 2 d λ p α α , P λ 1 P λ 2 P λ p d λ 1 d λ 2 d λ p {:[int alpha-=∬cdots int(:alpha,(delP)/(dellambda^(1))^^(delP)/(dellambda^(2))^^cdots^^(delP)/(dellambda^(p)):)],[dlambda^(1)dlambda^(2)dots dlambda^(p)]:}\begin{array}{r} \int \boldsymbol{\alpha} \equiv \iint \cdots \int\left\langle\boldsymbol{\alpha}, \frac{\partial \mathscr{P}}{\partial \lambda^{1}} \wedge \frac{\partial \mathscr{P}}{\partial \lambda^{2}} \wedge \cdots \wedge \frac{\partial \mathscr{P}}{\partial \lambda^{p}}\right\rangle \\ d \lambda^{1} d \lambda^{2} \ldots d \lambda^{p} \end{array}αα,Pλ1Pλ2Pλpdλ1dλ2dλp
This definition is identified with the computational rule of the preceding section (C.2) in exercise 4.9.

An application

Integrate a gradient d f d f df\boldsymbol{d} fdf along a curve, P ( λ ) P ( λ ) P(lambda)\mathscr{P}(\lambda)P(λ) from P ( 0 ) P ( 0 ) P(0)\mathscr{P}(0)P(0) to P ( 1 ) P ( 1 ) P(1)\mathscr{P}(1)P(1) :
d f = 0 1 d f , d P / d λ d λ = 0 1 ( d f / d λ ) d λ = f [ P ( 1 ) ] f [ P ( 0 ) ] d f = 0 1 d f , d P / d λ d λ = 0 1 ( d f / d λ ) d λ = f [ P ( 1 ) ] f [ P ( 0 ) ] {:[int df=int_(0)^(1)(:df","dP//d lambda:)d lambda=int_(0)^(1)(df//d lambda)d lambda],[=f[P(1)]-f[P(0)]]:}\begin{aligned} \int \boldsymbol{d} f & =\int_{0}^{1}\langle\boldsymbol{d} f, d \mathscr{P} / d \lambda\rangle d \lambda=\int_{0}^{1}(d f / d \lambda) d \lambda \\ & =f[\mathscr{P}(1)]-f[\mathscr{P}(0)] \end{aligned}df=01df,dP/dλdλ=01(df/dλ)dλ=f[P(1)]f[P(0)]
e. Three different uses for symbol "d": First, lightface d d ddd in explicit derivative expressions such as

Box 4.1 (continued)

d / d a d / d a d//dad / d ad/da, or d f / d a d f / d a df//dad f / d adf/da, or d P / d a d P / d a dP//dad \mathscr{P} / d adP/da; neither numerator nor denominator alone has any meaning, but only the full string of symbols. Second, lightface d d ddd inside an integral sign; e.g., f d a f d a int fda\int f d afda. This is an instruction to perform integration, and has no meaning whatsoever without an integral sign; " d " d "int dots d dots" \int \ldots d \ldots"d. lives as an indivisible unit. Third, sans-serif d d d\boldsymbol{d}d; e.g., d d d\boldsymbol{d}d alone, or d f d f df\boldsymbol{d} fdf, or d α d α d alpha\boldsymbol{d} \boldsymbol{\alpha}dα. This is an exterior derivative, which converts a p p ppp-form into a ( p + 1 ) ( p + 1 ) (p+1)(p+1)(p+1)-form. Sometimes lightface d d ddd is used for the same purpose. Hence, d d ddd alone, or d f d f dfd fdf, or d x d x dxd xdx, is always an exterior derivative unless coupled to an sign sign int sign\int \operatorname{sign}sign (second use), or coupled to a / sign (first use).
4. The generalized Stokes theorem (see Box 4.6).
a. Let V V delV\partial \mathscr{V}V be the closed p p ppp-dimensional boundary of a ( p + 1 ) ( p + 1 ) (p+1)(p+1)(p+1)-dimensional surface V V V\mathscr{V}V. Let σ σ sigma\boldsymbol{\sigma}σ be a p p ppp-form defined throughout V V V\mathscr{V}V.
Then
T d σ = T σ T d σ = T σ int_(T)d sigma=int_(delT)sigma\int_{\mathscr{T}} d \sigma=\int_{\partial \mathscr{T}} \sigmaTdσ=Tσ
[integral of p p ppp-form σ σ sigma\sigmaσ over boundary T T delT\partial \mathscr{T}T equals integral of ( p + 1 ) ( p + 1 ) (p+1)(p+1)(p+1)-form d σ d σ d sigma\boldsymbol{d} \boldsymbol{\sigma}dσ over interior V V V\mathscr{V}V ].
b. For the sign to come out right, orientations of V V V\mathscr{V}V and V V delV\partial \mathscr{V}V must agree in this sense: choose coordinates y 0 , y 1 , , y p y 0 , y 1 , , y p y^(0),y^(1),dots,y^(p)y^{0}, y^{1}, \ldots, y^{p}y0,y1,,yp on a portion of T T T\mathscr{T}T, with y 0 y 0 y^(0)y^{0}y0 specialized so y 0 0 y 0 0 y^(0) <= 0y^{0} \leq 0y00 in V V V\mathscr{V}V, and y 0 = 0 y 0 = 0 y^(0)=0y^{0}=0y0=0 at the boundary V V delV\partial \mathscr{V}V; then the orientation
P y 0 P y 1 P y p P y 0 P y 1 P y p (delP)/(dely^(0))^^(delP)/(dely^(1))^^cdots^^(delP)/(dely^(p))\frac{\partial \mathscr{P}}{\partial y^{0}} \wedge \frac{\partial \mathscr{P}}{\partial y^{1}} \wedge \cdots \wedge \frac{\partial \mathscr{P}}{\partial y^{p}}Py0Py1Pyp
for V V V\mathscr{V}V demands the orientation
P y 1 P y p P y 1 P y p (delP)/(dely^(1))^^cdots^^(delP)/(dely^(p))\frac{\partial \mathscr{P}}{\partial y^{1}} \wedge \cdots \wedge \frac{\partial \mathscr{P}}{\partial y^{p}}Py1Pyp
for V V delV\partial \mathscr{V}V.
c. Note: For a nonorientable surface, such as a Möbius strip, where a consistent and continuous choice of orientation is impossible, more intricate mathematics is required to give a definition of " del\partial " for which the Stokes theorem holds.
Applications: Includes as special cases all integral theorems for surfaces of arbitrary dimension in spaces of arbitrary dimension, with or without metric, generaliz-
ing all versions of theorems of Stokes and Gauss. Exampes:
a. V V V\mathscr{V}V a curve, V V delV\partial \mathscr{V}V its endpoints, σ = f σ = f sigma=f\boldsymbol{\sigma}=fσ=f a 0 -form (funcion):
d f = 0 1 ( d f / d λ ) d λ = V f = f ( 1 ) f ( 0 ) . d f = 0 1 ( d f / d λ ) d λ = V f = f ( 1 ) f ( 0 ) . int_(vv)df=int_(0)^(1)(df//d lambda)d lambda=int_(del V)f=f(1)-f(0).\int_{\vee} \boldsymbol{d} f=\int_{0}^{1}(d f / d \lambda) d \lambda=\int_{\partial V} f=f(1)-f(0) .df=01(df/dλ)dλ=Vf=f(1)f(0).
b. V V V\mathscr{V}V a 2 -surface in 3 -space, V V delV\partial \mathscr{V}V its closed-curve boundary, va 1 -form; translated into Euclidean vector notation, the two integrals are
V V d v = V V ( × v ) d S ; V v = γ v d I . V V d v = V V ( × v ) d S ; V v = γ v d I . int_(VV)dv=int_(VV)(grad xx v)*dS;int_(del V)v=int_(del gamma)v*dI.\int_{V V} d v=\int_{V V}(\nabla \times v) \cdot d S ; \int_{\partial V} v=\int_{\partial \gamma} v \cdot d I .VVdv=VV(×v)dS;Vv=γvdI.
c. Other applications in § § 5.8 , 20.2 , 20.3 , 20.5 § § 5.8 , 20.2 , 20.3 , 20.5 §§5.8,20.2,20.3,20.5\S \S 5.8,20.2,20.3,20.5§§5.8,20.2,20.3,20.5, and exercises 4.10, 4.11, 5.2, and below.

D. Algebra II (applicable to any vector space with metric)

  1. Norm of a p -form.
α 2 α | i 1 i p | α α 1 i p α 2 α i 1 i p α α 1 i p ||alpha||^(2)-=alpha_(|i_(1)dotsi_(p)|)alpha^(alpha_(1)dotsi_(p))\|\boldsymbol{\alpha}\|^{2} \equiv \alpha_{\left|i_{1} \ldots i_{p}\right|} \alpha^{\alpha_{1} \ldots i_{p}}α2α|i1ip|αα1ip
Two applications: Norm of a 1-form equals its squared length, α 2 = α α α 2 = α α ||alpha||^(2)=alpha*alpha\|\boldsymbol{\alpha}\|^{2}=\boldsymbol{\alpha} \cdot \boldsymbol{\alpha}α2=αα. Norm of electromagnetic 2-form or Faraday: F 2 = B 2 E 2 F 2 = B 2 E 2 ||F||^(2)=B^(2)-E^(2)\|\boldsymbol{F}\|^{2}=\boldsymbol{B}^{2}-\boldsymbol{E}^{2}F2=B2E2.
2. Dual of a p -form.
a. In an n n nnn-dimensional space, the dual of a p p ppp-form
α α alpha\boldsymbol{\alpha}α is the ( n p ) ( n p ) (n-p)(n-p)(np)-form α α ^(**)alpha{ }^{*} \boldsymbol{\alpha}α, with components
( α ) k 1 k n p = α | i 1 i p | ϵ i 1 i p k 1 k n p . α k 1 k n p = α i 1 i p ϵ i 1 i p k 1 k n p . (^(**)alpha)_(k_(1)dotsk_(n-p))=alpha^(|i_(1)dotsi_(p)|)epsilon_(i_(1)dotsi_(p))k_(1dotsk_(n-p)).\left({ }^{*} \alpha\right)_{k_{1} \ldots k_{n-p}}=\alpha^{\left|i_{1} \ldots i_{p}\right|} \boldsymbol{\epsilon}_{i_{1} \ldots i_{p}} k_{1 \ldots k_{n-p}} .(α)k1knp=α|i1ip|ϵi1ipk1knp.
b. Properties of duals:
α = ( 1 ) p 1 α in spacetime; α α = α 2 ε in general. α = ( 1 ) p 1 α  in spacetime;  α α = α 2 ε  in general.  {:[^(****)alpha=(-1)^(p-1)alpha" in spacetime; "],[alpha^^^(**)alpha=||alpha||^(2)epsi" in general. "]:}\begin{aligned} { }^{* *} \alpha & =(-1)^{p-1} \alpha \text { in spacetime; } \\ \alpha \wedge{ }^{*} \alpha & =\|\alpha\|^{2} \varepsilon \text { in general. } \end{aligned}α=(1)p1α in spacetime; αα=α2ε in general. 
c. Note: the definition of ε ε epsi\varepsilonε (exercise 3.13) entails choosing an orientation of the space, i.e., deciding which orthonormal bases (1) are "righthanded" and thus (2) have ε ( e 1 , , e n ) = + 1 ε e 1 , , e n = + 1 epsi(e_(1),dots,e_(n))=+1\varepsilon\left(\boldsymbol{e}_{1}, \ldots, \boldsymbol{e}_{n}\right)=+1ε(e1,,en)=+1.

Applications

a. For f f fff a 0 -form, f = f ε f = f ε ^(**)f=f epsi{ }^{*} f=f \varepsilonf=fε, and f d f d int fd\int f dfd (volume) = f = f =int^(**)f=\int{ }^{*} f=f.
b. Dual of charge-current 1 -form J J J\boldsymbol{J}J is charge-current 3 -form J J ^(**)J{ }^{*} \boldsymbol{J}J. The total charge Q Q QQQ in a 3 -dimensional hypersurface region S S SSS is
Q ( S ) = S J Q ( S ) = S J Q(S)=int_(S)**JQ(S)=\int_{S} * \boldsymbol{J}Q(S)=SJ

Box 4.1 (continued)

Conservation of charge is stated locally by d J = 0 d J = 0 d^(**)J=0\boldsymbol{d}^{\boldsymbol{*}} \boldsymbol{J}=0dJ=0. Stokes' Theorem goes from this differential conservation law to the integral conservation law,
0 = q d J = T J . 0 = q d J = T J . 0=int_(q)d^(**)J=int_(del T)^(**)J.0=\int_{q} \boldsymbol{d}^{*} \boldsymbol{J}=\int_{\partial T}{ }^{*} \boldsymbol{J} .0=qdJ=TJ.
This law is of most interest when V = S 2 S 1 V = S 2 S 1 delV=S_(2)-S_(1)\partial \mathscr{V}=S_{2}-S_{1}V=S2S1 consists of the future S 2 S 2 S_(2)S_{2}S2 and past S 1 S 1 S_(1)S_{1}S1 boundaries of a spacetime region, in which case it states Q ( S 2 ) = Q S 2 = Q(S_(2))=Q\left(S_{2}\right)=Q(S2)= Q ( S 1 ) Q S 1 Q(S_(1))Q\left(S_{1}\right)Q(S1); see exercise 5.2.
c. Dual of electromagnetic field tensor F = F = F=\boldsymbol{F}=F= Faraday is F = M a x w e l l F = M a x w e l l **F=Maxwell\boldsymbol{*} \boldsymbol{F}=\boldsymbol{M a x w e l l}F=Maxwell. From the d F = 4 π J d F = 4 π J d^(**)F=4pi^(**)J\boldsymbol{d}^{*} \boldsymbol{F}=4 \pi^{*} \boldsymbol{J}dF=4πJ Maxwell equation, find 4 π Q = 4 π S J = S d F = S F 4 π Q = 4 π S J = S d F = S F 4pi Q=4piint_(S)^(**)J=int_(S)d^(**)F=int_(del S)^(**)F4 \pi Q=4 \pi \int_{S}{ }^{*} \boldsymbol{J}=\int_{S} \boldsymbol{d}^{*} \boldsymbol{F}=\int_{\partial S}{ }^{*} \boldsymbol{F}4πQ=4πSJ=SdF=SF.
3. Simple forms revisited.
a. The dual of a simple form is simple.
b. Egg crate of σ σ ^(**)sigma{ }^{*} \sigmaσ is perpendicular to egg crate of σ = α β μ σ = α β μ sigma=alpha^^beta^^cdots^^mu\sigma=\boldsymbol{\alpha} \wedge \boldsymbol{\beta} \wedge \cdots \wedge \boldsymbol{\mu}σ=αβμ in this sense:
(1) pick any vector V V V\boldsymbol{V}V lying in intersection of surfaces of σ σ sigma\sigmaσ
( α , V = β , v = = μ , V = 0 ) ( α , V = β , v = = μ , V = 0 ) ((:alpha,V:)=(:beta,v:)=cdots=(:mu,V:)=0)(\langle\boldsymbol{\alpha}, \boldsymbol{V}\rangle=\langle\boldsymbol{\beta}, \boldsymbol{v}\rangle=\cdots=\langle\boldsymbol{\mu}, \boldsymbol{V}\rangle=0)(α,V=β,v==μ,V=0)
(2) pick any vector W W W\boldsymbol{W}W lying in intersection of surfaces of σ σ ^(**)sigma{ }^{*} \sigmaσ;
(3) then V V V\boldsymbol{V}V and W W W\boldsymbol{W}W are necessarily perpendicular: v w = 0 v w = 0 v*w=0\boldsymbol{v} \cdot \boldsymbol{w}=0vw=0.
Example: σ = 3 d t σ = 3 d t sigma=3dt\boldsymbol{\sigma}=3 \boldsymbol{d} tσ=3dt is a simple 1 -form in spacetime.
a. σ = 3 d x d y d z σ = 3 d x d y d z ^(**)sigma=-3dx^^dy^^dz{ }^{*} \boldsymbol{\sigma}=-3 \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} zσ=3dxdydz is a simple 3-form.
b. General vector in surfaces of σ σ sigma\boldsymbol{\sigma}σ is
V = V x e x + V y e y + V z e z . V = V x e x + V y e y + V z e z . V=V^(x)e_(x)+V^(y)e_(y)+V^(z)e_(z).\boldsymbol{V}=V^{x} \boldsymbol{e}_{x}+V^{y} \boldsymbol{e}_{y}+V^{z} \boldsymbol{e}_{z} .V=Vxex+Vyey+Vzez.
c. General vector in intersection of surfaces of σ σ ^(**)sigma{ }^{*} \boldsymbol{\sigma}σ is
W = W t e t . W = W t e t . W=W^(t)e_(t).\boldsymbol{W}=W^{t} \boldsymbol{e}_{t} .W=Wtet.
d. w V = 0 w V = 0 w*V=0\boldsymbol{w} \cdot \boldsymbol{V}=0wV=0.

§4.2. ELECTROMAGNETIC 2-FORM AND LORENTZ FORCE

The electromagnetic field tensor, Faraday = F = F =F=\boldsymbol{F}=F, is an antisymmetric second-rank tensor (i.e., 2 -form). Instead of expanding it in terms of the tensor products of basis 1 -forms,
F = F α β d x α d x β , F = F α β d x α d x β , F=F_(alpha beta)dx^(alpha)ox dx^(beta),\boldsymbol{F}=F_{\alpha \beta} \boldsymbol{d} x^{\alpha} \otimes \boldsymbol{d} x^{\beta},F=Fαβdxαdxβ,
the exterior calculus prefers to expand in terms of antisymmetrized tensor products ("exterior products," exercise 4.1):
(4.1) F = 1 2 F α β d x α d x β (4.2) d x α d x β d x α d x β d x β d x α (4.1) F = 1 2 F α β d x α d x β (4.2) d x α d x β d x α d x β d x β d x α {:[(4.1)F=(1)/(2)F_(alpha beta)dx^(alpha)^^dx^(beta)],[(4.2)dx^(alpha)^^dx^(beta)-=dx^(alpha)ox dx^(beta)-dx^(beta)ox dx^(alpha)]:}\begin{align*} \boldsymbol{F} & =\frac{1}{2} F_{\alpha \beta} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \tag{4.1}\\ \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} & \equiv \boldsymbol{d} x^{\alpha} \otimes \boldsymbol{d} x^{\beta}-\boldsymbol{d} x^{\beta} \otimes \boldsymbol{d} x^{\alpha} \tag{4.2} \end{align*}(4.1)F=12Fαβdxαdxβ(4.2)dxαdxβdxαdxβdxβdxα
Any 2-form (antisymmetric, second-rank tensor) can be so expanded. The symbol " ^^\wedge " is variously called a "wedge," a "hat," or an "exterior product sign"; and d x α d x β d x α d x β dx^(alpha)^^dx^(beta)\boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta}dxαdxβ are the "basis 2-forms" of a given Lorentz frame (see §3.5, exercise 3.12, and Box 4.1).
There is no simpler way to illustrate this 2-form representation of the electromagnetic field than to consider a magnetic field in the x x xxx-direction:
(4.3) F y z = F z y = B x , F = B x d y d z . (4.3) F y z = F z y = B x , F = B x d y d z . {:[(4.3)F_(yz)=-F_(zy)=B_(x)","],[F=B_(x)dy^^dz.]:}\begin{align*} F_{y z} & =-F_{z y}=B_{x}, \tag{4.3}\\ \boldsymbol{F} & =B_{x} \boldsymbol{d} y \wedge \boldsymbol{d} z . \end{align*}(4.3)Fyz=Fzy=Bx,F=Bxdydz.
The 1 -form d y = grad y d y = grad y dy=grad y\boldsymbol{d} y=\operatorname{grad} ydy=grady is the set of surfaces (actually hypersurfaces) y = 18 y = 18 y=18y=18y=18 (all t , x , z ) , y = 19 t , x , z ) , y = 19 t,x,z),y=19t, x, z), y=19t,x,z),y=19 (all t , x , z t , x , z t,x,zt, x, zt,x,z ) , y = 20 y = 20 y=20y=20y=20 (all t , x , z t , x , z t,x,zt, x, zt,x,z ), etc.; and surfaces uniformly interpolated between them. Similarly for the 1 -form d z d z dz\boldsymbol{d} zdz. The intersection between these two sets of surfaces produces a honeycomb-like structure. That structure becomes a " 2 -form" when it is supplemented by instructions (see arrows in Figure 4.1) that give a "sense of circulation" to each tube of the honeycomb (order of factors in the "wedge product" of equation 4.2; d y d z = d z d y d y d z = d z d y dy^^dz=-dz^^dy\boldsymbol{d} y \wedge \boldsymbol{d} z=-\boldsymbol{d} z \wedge \boldsymbol{d} ydydz=dzdy ). The 2-form F F F\boldsymbol{F}F in the example differs from this "basis 2-form" d y d z d y d z dy^^dz\boldsymbol{d} y \wedge \boldsymbol{d} zdydz only in this respect, that where d y d z d y d z dy^^dz\boldsymbol{d} y \wedge \boldsymbol{d} zdydz had one tube, the field 2 -form has B x B x B_(x)B_{x}Bx tubes.
When one considers a tubular structure that twists and turns on its way through spacetime, one must have more components to describe it. The 2 -form for the general electromagnetic field can be written as
F = E x d x d t + E y d y d t + E z d z d t + B x d y d z (4.4) + B y d z d x + B z d x d y F = E x d x d t + E y d y d t + E z d z d t + B x d y d z (4.4) + B y d z d x + B z d x d y {:[F=E_(x)dx^^dt+E_(y)dy^^dt+E_(z)dz^^dt+B_(x)dy^^dz],[(4.4)+B_(y)dz^^dx+B_(z)dx^^dy]:}\begin{align*} \boldsymbol{F}= & E_{x} \boldsymbol{d} x \wedge \boldsymbol{d} t+E_{y} \boldsymbol{d} y \wedge \boldsymbol{d} t+E_{z} \boldsymbol{d} z \wedge \boldsymbol{d} t+B_{x} \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & +B_{y} \boldsymbol{d} z \wedge \boldsymbol{d} x+B_{z} \boldsymbol{d} x \wedge \boldsymbol{d} y \tag{4.4} \end{align*}F=Exdxdt+Eydydt+Ezdzdt+Bxdydz(4.4)+Bydzdx+Bzdxdy
( 6 components, 6 basis 2 -forms).
A 1 -form is a machine to produce a number out of a vector (bongs of a bell as the vector pierces successive surfaces). A 2 -form is a machine to produce a number out of an oriented surface (surface with a sense of circulation indicated on it: Figure 4.1, lower right). The meaning is as clear here as it is in elementary magnetism:
Electromagnetic 2-form expressed in terms of exterior products
A 2-form as a honeycomb of tubes with a sense of circulation
A 2-form as a machine to produce a number out of an oriented surface
Figure 4.1.
Construction of the 2-form for the electromagnetic field F = B x d y d z F = B x d y d z F=B_(x)dy^^dz\boldsymbol{F}=B_{x} \boldsymbol{d} y \wedge \boldsymbol{d} zF=Bxdydz out of the 1-forms d y d y dy\boldsymbol{d} ydy and d z d z dz\boldsymbol{d} zdz by "wedge multiplication" (formation of honeycomb-like structure with sense of circulation indicated by arrows). A 2 -form is a "machine to construct a number out of an oriented surface" (illustrated by sample surface enclosed by arrows at lower right; number of tubes intersected by this surface is
(this surface) F = 18 ; (this surface)  F = 18 ; int_((this surface) )F=18;\int_{\text {(this surface) }} \boldsymbol{F}=18 ;(this surface) F=18;
Faraday's concept of "magnetic flux"). This idea of 2-form machinery can be connected to the "tensor-as-machine" idea of Chapter 3 as follows. The shape of the oriented surface over which one integrates F F F\boldsymbol{F}F does not matter, for small surfaces. All that affects F F int F\int \boldsymbol{F}F is the area of the surface, and its orientation. Choose two vectors, u u u\boldsymbol{u}u and v v v\boldsymbol{v}v, that lie in the surface. They form two legs of a parallelogram, whose orientation ( u u u\boldsymbol{u}u followed by v v v\boldsymbol{v}v ) and area are embodied in the exterior product u v u v u^^v\boldsymbol{u} \wedge \boldsymbol{v}uv. Adjust the lengths of u u u\boldsymbol{u}u and v v v\boldsymbol{v}v so their parallelogram, u v u v u^^v\boldsymbol{u} \wedge \boldsymbol{v}uv, has the same area as the surface of integration. Then
Exercise: derive this result, for an infinitesimal surface u v u v u^^v\boldsymbol{u} \wedge \boldsymbol{v}uv and for general F F F\boldsymbol{F}F, using the formalism of Box 4.1.
the number of Faraday tubes cut by that surface. The electromagnetic 2-form F F F\boldsymbol{F}F or Faraday described by such a "tubular structure" (suitably abstracted; Box 4.2) has a reality and a location in space that is independent of all coordinate systems and all artificial distinctions between "electric" and "magnetic" fields. Moreover, those tubes provide the most direct geometric representation that anyone has ever been able to give for the machinery by which the electromagnetic field acts on a charged particle. Take a particle of charge e e eee and 4 -velocity
(4.5) u = d x α d τ e α (4.5) u = d x α d τ e α {:(4.5)u=(dx^(alpha))/(d tau)e_(alpha):}\begin{equation*} \boldsymbol{u}=\frac{d x^{\alpha}}{d \tau} \boldsymbol{e}_{\alpha} \tag{4.5} \end{equation*}(4.5)u=dxαdτeα
Let this particle go through a region where the electromagnetic field is described by the 2 -form
(4.6) F = B x d y d z (4.6) F = B x d y d z {:(4.6)F=B_(x)dy^^dz:}\begin{equation*} \boldsymbol{F}=B_{x} \boldsymbol{d} y \wedge \boldsymbol{d} z \tag{4.6} \end{equation*}(4.6)F=Bxdydz
of Figure 4.1. Then the force exerted on the particle (regarded as a 1 -form) is the contraction of this 2 -form with the 4 -velocity (and the charge);
(4.7) ρ ˙ = d p / d τ = e F ( u ) e F , u , (4.7) ρ ˙ = d p / d τ = e F ( u ) e F , u , {:(4.7)rho^(˙)=dp//d tau=eF(u)-=e(:F","u:)",":}\begin{equation*} \dot{\boldsymbol{\rho}}=d \boldsymbol{p} / d \tau=e \boldsymbol{F}(\boldsymbol{u}) \equiv e\langle\boldsymbol{F}, \boldsymbol{u}\rangle, \tag{4.7} \end{equation*}(4.7)ρ˙=dp/dτ=eF(u)eF,u,
as one sees by direct evaluation, letting the two factors in the 2 -form act in turn on the tangent vector u u u\boldsymbol{u}u :
ρ ˙ = e B x d y d z , u = e B x { d y d z , u d z d y , u } = e B x { d y d z , u z e z d z d y , u y e y } ρ ˙ = e B x d y d z , u = e B x { d y d z , u d z d y , u } = e B x d y d z , u z e z d z d y , u y e y {:[rho^(˙)=eB_(x)(:dy^^dz","u:)],[=eB_(x){dy(:dz","u:)-dz(:dy","u:)}],[=eB_(x){dy(:dz,u^(z)e_(z):)-dz(:dy,u^(y)e_(y):)}]:}\begin{aligned} \dot{\boldsymbol{\rho}} & =e B_{x}\langle\boldsymbol{d} y \wedge \boldsymbol{d} z, \boldsymbol{u}\rangle \\ & =e B_{x}\{\boldsymbol{d} y\langle\boldsymbol{d} z, \boldsymbol{u}\rangle-\boldsymbol{d} z\langle\boldsymbol{d} y, \boldsymbol{u}\rangle\} \\ & =e B_{x}\left\{\boldsymbol{d} y\left\langle\boldsymbol{d} z, u^{z} \boldsymbol{e}_{z}\right\rangle-\boldsymbol{d} z\left\langle\boldsymbol{d} y, u^{y} \boldsymbol{e}_{y}\right\rangle\right\} \end{aligned}ρ˙=eBxdydz,u=eBx{dydz,udzdy,u}=eBx{dydz,uzezdzdy,uyey}
or
(4.8) p ˙ α d x α = e B x u z d y e B x u y d z (4.8) p ˙ α d x α = e B x u z d y e B x u y d z {:(4.8)p^(˙)_(alpha)dx^(alpha)=eB_(x)u^(z)dy-eB_(x)u^(y)dz:}\begin{equation*} \dot{p}_{\alpha} \boldsymbol{d} x^{\alpha}=e B_{x} u^{z} \boldsymbol{d} y-e B_{x} u^{y} \boldsymbol{d} z \tag{4.8} \end{equation*}(4.8)p˙αdxα=eBxuzdyeBxuydz
Comparing coefficients of the separate basis 1 -forms on the two sides of this equation, one sees reproduced all the detail of the Lorentz force exerted by the magnetic field B x B x B_(x)B_{x}Bx :
p ˙ y = d p y d τ = e B x d z d τ , (4.9) p ˙ z = d p z d τ = e B x d y d τ . p ˙ y = d p y d τ = e B x d z d τ , (4.9) p ˙ z = d p z d τ = e B x d y d τ . {:[p^(˙)_(y)=(dp_(y))/(d tau)=eB_(x)(dz)/(d tau)","],[(4.9)p^(˙)_(z)=(dp_(z))/(d tau)=-eB_(x)(dy)/(d tau).]:}\begin{align*} & \dot{p}_{y}=\frac{d p_{y}}{d \tau}=e B_{x} \frac{d z}{d \tau}, \\ & \dot{p}_{z}=\frac{d p_{z}}{d \tau}=-e B_{x} \frac{d y}{d \tau} . \tag{4.9} \end{align*}p˙y=dpydτ=eBxdzdτ,(4.9)p˙z=dpzdτ=eBxdydτ.
By simple extension of this line of reasoning to the general electromagnetic field, one concludes that the time-rate of change of momentum ( 1 -form) is equal to the charge multiplied by the contraction of the Faraday with the 4-velocity. Figure 4.2 illustrates pictorially how the 2 -form, F F F\boldsymbol{F}F, serves as a machine to produce the 1 -form, p ˙ p ˙ p^(˙)\dot{\boldsymbol{p}}p˙, out of the tangent vector, eu.

Box 4.2 ABSTRACTING A 2-FORM FROM THE CONCEPT OF "HONEYCOMBLIKE STRUCTURE," IN 3-SPACE AND IN SPACETIME

Open up a cardboard carton containing a dozen bottles, and observe the honeycomb structure of intersecting north-south and east-west cardboard separators between the bottles. That honeycomb structure of "tubes" ("channels for bottles") is a fairly apt illustration of a 2 -form in the context of everyday 3 -space. It yields a number (number of tubes cut) for each choice of smooth element of 2 -surface slicing through the three-dimensional structure. However, the intersecting cardboard separators are rather too specific. All that a true 2 -form can ever give is the number of tubes sliced through, not the "shape" of the tubes. Slew the carton around on the floor by 45 45 45^(@)45^{\circ}45. Then half the separators run NW-SE and the other half run NE-SW, but through a given bit of 2 -surface fixed in 3 -space the count of tubes is unchanged. Therefore, one should be careful to make the concept of tubes in the mind's eye abstract enough that one envisages direction of tubes (vertical in the example) and density of tubes, but not any specific location or orientation for the tube walls. Thus all the following representations give one and the same 2-form, σ σ sigma\boldsymbol{\sigma}σ :
σ = B d x d y σ = B ( 2 d x ) ( 1 2 d y ) σ = B d x d y σ = B ( 2 d x ) 1 2 d y {:[sigma=Bdx^^dy],[sigma=B(2dx)^^((1)/(2)dy)]:}\begin{aligned} & \boldsymbol{\sigma}=B \boldsymbol{d} x \wedge \boldsymbol{d} y \\ & \boldsymbol{\sigma}=B(2 \boldsymbol{d} x) \wedge\left(\frac{1}{2} \boldsymbol{d} y\right) \end{aligned}σ=Bdxdyσ=B(2dx)(12dy)
(NS cardboards spaced twice as close as before; EW cardboards spaced twice as wide as before);
σ = B d ( x y 2 ) d ( x + y 2 ) (cardboards rotated through 45 ) σ = B α d x + β d y ( α δ β γ ) 1 / 2 γ d x + δ d y ( α δ β γ ) 1 / 2 σ = B d x y 2 d x + y 2  (cardboards rotated through 45  ) σ = B α d x + β d y ( α δ β γ ) 1 / 2 γ d x + δ d y ( α δ β γ ) 1 / 2 {:[sigma=Bd((x-y)/(sqrt2))^^d((x+y)/(sqrt2))],[" (cardboards rotated through 45 ")],[sigma=B(alpha dx+beta dy)/((alpha delta-beta gamma)^(1//2))^^(gamma dx+delta dy)/((alpha delta-beta gamma)^(1//2))]:}\begin{gathered} \boldsymbol{\sigma}=B \boldsymbol{d}\left(\frac{x-y}{\sqrt{2}}\right) \wedge \boldsymbol{d}\left(\frac{x+y}{\sqrt{2}}\right) \\ \text { (cardboards rotated through 45 }) \\ \boldsymbol{\sigma}=B \frac{\alpha \boldsymbol{d} x+\beta \boldsymbol{d} y}{(\alpha \delta-\beta \gamma)^{1 / 2}} \wedge \frac{\gamma \boldsymbol{d} x+\delta \boldsymbol{d} y}{(\alpha \delta-\beta \gamma)^{1 / 2}} \end{gathered}σ=Bd(xy2)d(x+y2) (cardboards rotated through 45 )σ=Bαdx+βdy(αδβγ)1/2γdx+δdy(αδβγ)1/2
(both orientation and spacing of "cardboards" changing from point to point, with all four
functions, α , β , γ α , β , γ alpha,beta,gamma\alpha, \beta, \gammaα,β,γ, and δ δ delta\deltaδ, depending on position).
What has physical reality, and constitutes the real geometric object, is not any one of the 1 -forms just encountered individually, but only the 2 -form σ σ sigma\boldsymbol{\sigma}σ itself. This circumstance helps to explain why in the physical literature one sometimes refers to "tubes of force" and sometimes to "lines of force." The two terms for the same structure have this in common, that each yields a number when sliced by a bit of surface. The line-of-force picture has the advantage of not imposing on the mind any specific structure of "sheets of cardboard"; that is, any specific decomposition of the 2 -form into the product of 1 -forms. However, that very feature is also a disadvantage, for in a calculation one often finds it useful to have a well-defined representation of the 2 -form as the wedge product of 1 -forms. Moreover, the tube picture, abstract though it must be if it is to be truthful, also has this advantage, that the orientation of the elementary tubes (sense of circulation as indicated by arrows in Figures 4.1 and 4.5 , for example) lends itself to ready visualization. Let the "walls" of the tubes therefore remain in all pictures drawn in this book as a reminder that 2 -forms can be built out of 1 -forms; but let it be understood here and hereafter how manyfold are the options for the individual 1-forms!
Turn now from three dimensions to four, and find that the concept of "honeycomb-like structure" must be made still more abstract. In three dimensions the arbitrariness of the decomposition of the 2 -form into 1 -forms showed in the slant and packing of the "cardboards," but had no effect on the verticality of the "channels for the bottles" ("direction of Faraday lines of force or tubes of
force"); not so in four dimensions, or at least not in the generic case in four dimensions.
In special cases, the story is almost as simple in four dimensions as in three. An example of a special case is once again the 2 -form σ = B d x σ = B d x sigma=Bdx\sigma=B \boldsymbol{d} xσ=Bdx d y d y ^^dy\wedge \boldsymbol{d} ydy, with all the options for decomposition into 1 -forms that have already been mentioned, but with every option giving the same "direction" for the tubes. If the word "direction" now rises in status from "tube walls unpierced by motion in the direction of increasing z z zzz " to "tube walls unpierced either by motion in the direction of increasing z z zzz, or by motion in the direction of increasing t t ttt, or by any linear combination of such motions," that is a natural enough consequence of adding the new dimension. Moreover, the same simplicity prevails for an electromagnetic plane wave. For example, let the wave be advancing in the z z zzz-direction, and let the electric polarization point in the x x xxx-direction; then for a monochromatic wave, one has
E x = B y = E 0 cos ω ( z t ) = F 01 = F 31 , E x = B y = E 0 cos ω ( z t ) = F 01 = F 31 , E_(x)=B_(y)=E_(0)cos omega(z-t)=-F_(01)=F_(31),E_{x}=B_{y}=E_{0} \cos \omega(z-t)=-F_{01}=F_{31},Ex=By=E0cosω(zt)=F01=F31,
and all components distinct from these equal zero.
Faraday is
F = F 01 d t d x + F 31 d z d x = E 0 cos ω ( z t ) d ( z t ) d x F = F 01 d t d x + F 31 d z d x = E 0 cos ω ( z t ) d ( z t ) d x {:[F=F_(01)dt^^dx+F_(31)dz^^dx],[=E_(0)cos omega(z-t)d(z-t)^^dx]:}\begin{aligned} \boldsymbol{F} & =F_{01} \boldsymbol{d} t \wedge \boldsymbol{d} x+F_{31} \boldsymbol{d} z \wedge \boldsymbol{d} x \\ & =E_{0} \cos \omega(z-t) \boldsymbol{d}(z-t) \wedge \boldsymbol{d} x \end{aligned}F=F01dtdx+F31dzdx=E0cosω(zt)d(zt)dx
which is again representable as a single wedge product of two 1 -forms.
Not so in general! The general 2 -form in four dimensions consists of six distinct wedge products,
F = F 01 d t d x + F 02 d t d y + + F 23 d y d z F = F 01 d t d x + F 02 d t d y + + F 23 d y d z {:[F=F_(01)dt^^dx+F_(02)dt^^dy+cdots],[+F_(23)dy^^dz]:}\begin{aligned} \boldsymbol{F}=F_{01} \boldsymbol{d} t \wedge \boldsymbol{d} x+F_{02} \boldsymbol{d} t \wedge \boldsymbol{d} y & +\cdots \\ & +F_{23} \boldsymbol{d} y \wedge \boldsymbol{d} z \end{aligned}F=F01dtdx+F02dtdy++F23dydz
It is too much to hope that this expression will reduce in the generic case to a single wedge product of two 1 -forms ("simple" 2 -form). It is not even
true that it will. It is only remarkable that it can be reduced from six exterior products to two (details in exercise 4.1); thus,
F = π 1 ξ 1 + π 2 ξ 2 F = π 1 ξ 1 + π 2 ξ 2 F=pi^(1)^^xi^(1)+pi^(2)^^xi^(2)\boldsymbol{F}=\boldsymbol{\pi}^{1} \wedge \xi^{1}+\boldsymbol{\pi}^{2} \wedge \xi^{2}F=π1ξ1+π2ξ2
Each product π i ξ i π i ξ i pi^(i)^^xi^(i)\boldsymbol{\pi}^{i} \wedge \xi^{i}πiξi individually can be visualized as a honeycomb-like structure like those depicted in Figures 4.1, 4.2, 4.4, and 4.5. Each such structure individually can be pictured as built out of intersecting sheets (1-forms), but with such details as the tilt and packing of these 1 -forms abstracted away. Each such structure individually gives a number when sliced by an element of surface. What counts for the 2 -form F F F\boldsymbol{F}F, however, is neither the number of tubes of π 1 ξ 1 π 1 ξ 1 pi^(1)^^xi^(1)\boldsymbol{\pi}^{1} \wedge \xi^{1}π1ξ1 cut by the surface, nor the number of tubes of n 2 ξ 2 n 2 ξ 2 n^(2)^^xi^(2)\boldsymbol{n}^{2} \wedge \xi^{2}n2ξ2 cut by the surface, but only the sum of the two. This sum is what is referred to in the text as the "number of tubes of F F F\boldsymbol{F}F " cut by the surface. The contribution of either wedge product individually is not well-defined, for a simple reason: the decomposition of a six-wedge-product object into two wedge products, miraculous though it seems, is actually far from unique (details in exercise 4.2).
In keeping with the need to have two products of 1 -forms to represent the general 2 -form note that the vanishing of d F d F dF\boldsymbol{d F}dF ("no magnetic charges") does not automatically imply that d ( n 1 ξ 1 ) d n 1 ξ 1 d(n^(1)^^xi^(1))\boldsymbol{d}\left(\boldsymbol{n}^{1} \wedge \xi^{1}\right)d(n1ξ1) or d ( n 2 ξ 2 ) d n 2 ξ 2 d(n^(2)^^xi^(2))\boldsymbol{d}\left(\boldsymbol{n}^{2} \wedge \xi^{2}\right)d(n2ξ2) separately vanish. Note also that any spacelike slice through the general 2 -form F F F\boldsymbol{F}F (reduction from four dimensions to three) can always be represented in terms of a honeycomb-like structure ("simple" 2 -form in three dimensions; Faraday's picture of magnetic tubes of force).
Despite the abstraction that has gone on in seeing in all generality what a 2 -form is, there is no bar to continuing to use the term "honeycomb-like structure" in a broadened sense to describe this object; and that is the practice here and hereafter.
Figure 4.2.
The Faraday or 2 -form F F F\boldsymbol{F}F of the electromagnetic field is a machine to produce a 1 -form (the time-rate of change of momentum p ˙ p ˙ p^(˙)\dot{\boldsymbol{p}}p˙ of a charged particle) out of a tangent vector (product of charge e e eee of the particle and its 4 -velocity u u u\boldsymbol{u}u ). In spacetime the general 2 -form is the "superposition" (see Box 4.2 ) of two structures like that illustrated at the top of this diagram, the tubes of the first being tilted and packed as indicated, the tubes of the second being tilted in another direction and having a different packing density.

§4.3. FORMS ILLUMINATE ELECTROMAGNETISM, AND ELECTROMAGNETISM ILLUMINATES FORMS

All electromagnetism allows itself to be summarized in the language of 2-forms, honeycomb-like "structures" (again in the abstract sense of "structure" of Box 4.2) of tubes filling all spacetime, as well when spacetime is curved as when it is flat. In brief, there are two such structures, one Faraday = F = F =F=\boldsymbol{F}=F, the other Maxwell = F = F =^(**F)={ }^{\boldsymbol{*} F}=F, each dual ("perpendicular," the only place where metric need enter the discussion) to the other, each satisfying an elementary equation:
(4.10) d F = 0 (4.10) d F = 0 {:(4.10)dF=0:}\begin{equation*} \boldsymbol{d} \boldsymbol{F}=0 \tag{4.10} \end{equation*}(4.10)dF=0
("no tubes of Faraday ever end") and
(4.11) d F = 4 π J (4.11) d F = 4 π J {:(4.11)d^(**)F=4pi^(**)J:}\begin{equation*} \boldsymbol{d}^{*} \boldsymbol{F}=4 \pi^{*} \boldsymbol{J} \tag{4.11} \end{equation*}(4.11)dF=4πJ
("the number of tubes of Maxwell that end in an elementary volume is equal to the amount of electric charge in that volume"). To see in more detail how this machinery shows up in action, look in turn at: (1) the definition of a 2 -form; (2) the appearance of a given electromagnetic field as Faraday and as Maxwell; (3) the Maxwell structure for a point-charge at rest; (4) the same for a point-charge in motion; (5) the nature of the field of a charge that moves uniformly except during a brief instant of acceleration; (6) the Faraday structure for the field of an oscillating dipole; (7) the concept of exterior derivative; (8) Maxwell's equations in the language of forms; and (9) the solution of Maxwell's equations in flat spacetime, using a 1 -form A A A\boldsymbol{A}A from which the Liénard-Wiechert 2 -form F F F\boldsymbol{F}F can be calculated via F = d A F = d A F=dA\boldsymbol{F}=\boldsymbol{d} \boldsymbol{A}F=dA.
A 2 -form, as illustrated in Figure 4.1, is a machine to construct a number ("net number of tubes cut") out of any "oriented 2 -surface" (2-surface with "sense of circulation" marked on it):
(4.12) ( number of tubes cut ) = surface F . (4.12)  number   of tubes   cut  = surface  F {:(4.12)([" number "],[" of tubes "],[" cut "])=int_("surface ")F". ":}\left(\begin{array}{l} \text { number } \tag{4.12}\\ \text { of tubes } \\ \text { cut } \end{array}\right)=\int_{\text {surface }} \boldsymbol{F} \text {. }(4.12)( number  of tubes  cut )=surface F
For example, let the 2-form be the one illustrated in Figure 4.1
F = B x d y d z F = B x d y d z F=B_(x)dy^^dz\boldsymbol{F}=B_{x} \boldsymbol{d} y \wedge \boldsymbol{d} zF=Bxdydz
and let the surface of integration be the portion of the surface of the 2 -sphere x 2 + y 2 + z 2 = a 2 , t = x 2 + y 2 + z 2 = a 2 , t = x^(2)+y^(2)+z^(2)=a^(2),t=x^{2}+y^{2}+z^{2}=a^{2}, t=x2+y2+z2=a2,t= constant, bounded between θ = 70 θ = 70 theta=70^(@)\theta=70^{\circ}θ=70 and θ = 110 θ = 110 theta=110^(@)\theta=110^{\circ}θ=110 and between φ = 0 φ = 0 varphi=0^(@)\varphi=0^{\circ}φ=0 and φ = 90 φ = 90 varphi=90^(@)\varphi=90^{\circ}φ=90 ("Atlantic region of the tropics"). Write
y = a sin θ sin φ , z = a cos θ , d y = a ( cos θ sin φ d θ + sin θ cos φ d φ ) , d z = a sin θ d θ , (4.13) d y d z = a 2 sin 2 θ cos φ d θ d φ . y = a sin θ sin φ , z = a cos θ , d y = a ( cos θ sin φ d θ + sin θ cos φ d φ ) , d z = a sin θ d θ , (4.13) d y d z = a 2 sin 2 θ cos φ d θ d φ . {:[y=a sin theta sin varphi","],[z=a cos theta","],[dy=a(cos theta sin varphi d theta+sin theta cos varphi d varphi)","],[dz=-a sin theta d theta","],[(4.13)dy^^dz=a^(2)sin^(2)theta cos varphi d theta^^d varphi.]:}\begin{align*} y & =a \sin \theta \sin \varphi, \\ z & =a \cos \theta, \\ \boldsymbol{d} y & =a(\cos \theta \sin \varphi \boldsymbol{d} \theta+\sin \theta \cos \varphi \boldsymbol{d} \varphi), \\ \boldsymbol{d} z & =-a \sin \theta \boldsymbol{d} \theta, \\ \boldsymbol{d} y \wedge \boldsymbol{d} z & =a^{2} \sin ^{2} \theta \cos \varphi \boldsymbol{d} \theta \wedge \boldsymbol{d} \varphi . \tag{4.13} \end{align*}y=asinθsinφ,z=acosθ,dy=a(cosθsinφdθ+sinθcosφdφ),dz=asinθdθ,(4.13)dydz=a2sin2θcosφdθdφ.
Preview of key points in electromagnetism
A 2-form as machine for number of tubes cut
Number of tubes cut calculated in one example
Figure 4.3.
Spacelike slices through Faraday, the electromagnetic 2-form, a geometric object, a honeycomb of tubes that pervades all spacetime ("honeycomb" in the abstract sense spelled out more precisely in Box 4.2). The surfaces in the drawing do not look like a 2 -form (honeycomb), because the second family of surfaces making up the honeycomb extends in the spatial direction that is suppressed from the drawing. Diagram A shows one spacelike slice through the 2 -form (time increases upwards in the diagram). In diagram B, a projection of the 2 -form on this spacelike hypersurface gives the Faraday tubes of magnetic force in this three-dimensional geometry (if the suppressed dimension were restored, the tubes would be tubes, not channels between lines). Diagram C shows another spacelike slice (hypersurface of simultaneity for an observer in a different Lorentz frame). Diagram D shows the very different pattern of magnetic tubes in this reference system. The demand that magnetic tubes of force shall not end ( B = 0 ) ( B = 0 ) (grad*B=0)(\boldsymbol{\nabla} \cdot \boldsymbol{B}=0)(B=0), repeated over and over for every spacelike slice through Faraday, gives everywhere the result B / t = × E B / t = × E del B//del t=-grad xx E\partial \boldsymbol{B} / \partial t=-\boldsymbol{\nabla} \times \boldsymbol{E}B/t=×E. Thus (magnetostatics) + (covariance) longrightarrow\longrightarrow (magnetodynamics). Similarly-see Chapters 17 and 21(geometrostatics) + (covariance) longrightarrow\longrightarrow (geometrodynamics).
The structure d θ d θ d θ d θ d theta^^d theta\boldsymbol{d} \theta \wedge \boldsymbol{d} \thetadθdθ looks like a "collapsed egg-crate" (Figure 1.4, upper right) and has zero content, a fact formally evident from the vanishing of α β = α β = alpha^^beta=\alpha \wedge \beta=αβ= β α β α -beta^^alpha-\boldsymbol{\beta} \wedge \boldsymbol{\alpha}βα when α α alpha\boldsymbol{\alpha}α and β β beta\boldsymbol{\beta}β are identical. The result of the integration, assuming constant B x B x B_(x)B_{x}Bx, is
(4.14) surface F = a 2 B x 70 110 sin 2 θ d θ 0 90 cos φ d φ (4.14) surface  F = a 2 B x 70 110 sin 2 θ d θ 0 90 cos φ d φ {:(4.14)int_("surface ")F=a^(2)B_(x)int_(70^(@))^(110^(@))sin^(2)theta d thetaint_(0^(@))^(90^(@))cos varphi d varphi:}\begin{equation*} \int_{\text {surface }} \boldsymbol{F}=a^{2} B_{x} \int_{70^{\circ}}^{110^{\circ}} \sin ^{2} \theta d \theta \int_{0^{\circ}}^{90^{\circ}} \cos \varphi d \varphi \tag{4.14} \end{equation*}(4.14)surface F=a2Bx70110sin2θdθ090cosφdφ
It is not so easy to visualize a pure electric field by means of its 2 -form F F F\boldsymbol{F}F (Figure 4.4 , left) as it is to visualize a pure magnetic field by means of its 2 -form F F F\boldsymbol{F}F (Figures 4.1, 4.2, 4.3). Is there not some way to treat the two fields on more nearly the same footing? Yes, construct the 2 -form * F F F\boldsymbol{F}F (Figure 4.4, right) that is dual ("perpendicular"; Box 4.3; exercise 3.14) to F F F\boldsymbol{F}F.
Figure 4.4.
The Faraday structure
F = 1 2 F μ ν d x μ d x y = 1 2 F 01 d t d x + 1 2 F 10 d x d t = E x d x d t F = 1 2 F μ ν d x μ d x y = 1 2 F 01 d t d x + 1 2 F 10 d x d t = E x d x d t F=(1)/(2)F_(mu nu)dx^(mu)^^dx^(y)=(1)/(2)F_(01)dt^^dx+(1)/(2)F_(10)dx^^dt=E_(x)dx^^dt\boldsymbol{F}=\frac{1}{2} F_{\mu \nu} \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{y}=\frac{1}{2} F_{01} \boldsymbol{d} t \wedge \boldsymbol{d} x+\frac{1}{2} F_{10} \boldsymbol{d} x \wedge \boldsymbol{d} t=E_{x} \boldsymbol{d} x \wedge \boldsymbol{d} tF=12Fμνdxμdxy=12F01dtdx+12F10dxdt=Exdxdt
associated with an electric field in the x x xxx-direction, and the dual ("perpendicular") Maxwell honeycomblike 2 -form
F = 1 2 F μ ν d x μ d x y = F 23 d x 2 d x 3 = F 01 d x 2 d x 3 = F 10 d x 2 d x 3 = E x d y d z F = 1 2 F μ ν d x μ d x y = F 23 d x 2 d x 3 = F 01 d x 2 d x 3 = F 10 d x 2 d x 3 = E x d y d z ^(**)F=(1)/(2)^(**)F_(mu nu)dx^(mu)^^dx^(y)=^(**)F_(23)dx^(2)^^dx^(3)=F^(01)dx^(2)^^dx^(3)=F_(10)dx^(2)^^dx^(3)=E_(x)dy^^dz{ }^{*} \boldsymbol{F}=\frac{1}{2}{ }^{*} F_{\mu \nu} \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{y}={ }^{*} F_{23} \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3}=F^{01} \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3}=F_{10} \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3}=E_{x} \boldsymbol{d} y \wedge \boldsymbol{d} zF=12Fμνdxμdxy=F23dx2dx3=F01dx2dx3=F10dx2dx3=Exdydz
Represent in geometric form the field of a point-charge of strength e e eee at rest at the origin. Operate in flat space with spherical polar coordinates:
d s 2 = d τ 2 = g μ ν d x μ d x ν (4.15) = d t 2 + d r 2 + r 2 d θ 2 + r 2 sin 2 θ d φ 2 d s 2 = d τ 2 = g μ ν d x μ d x ν (4.15) = d t 2 + d r 2 + r 2 d θ 2 + r 2 sin 2 θ d φ 2 {:[ds^(2)=-dtau^(2)=g_(mu nu)dx^(mu)dx^(nu)],[(4.15)=-dt^(2)+dr^(2)+r^(2)dtheta^(2)+r^(2)sin^(2)theta dvarphi^(2)]:}\begin{align*} d s^{2} & =-d \tau^{2}=g_{\mu \nu} d x^{\mu} d x^{\nu} \\ & =-d t^{2}+d r^{2}+r^{2} d \theta^{2}+r^{2} \sin ^{2} \theta d \varphi^{2} \tag{4.15} \end{align*}ds2=dτ2=gμνdxμdxν(4.15)=dt2+dr2+r2dθ2+r2sin2θdφ2
The electric field in the r r rrr-direction being E r = e / r 2 E r = e / r 2 E_(r)=e//r^(2)E_{r}=e / r^{2}Er=e/r2, it follows that the 2 -form F F F\boldsymbol{F}F or Faraday is
(4.16) F = 1 2 F μ ν d x μ d x ν = E r d t d r = e r 2 d t d r (4.16) F = 1 2 F μ ν d x μ d x ν = E r d t d r = e r 2 d t d r {:(4.16)F=(1)/(2)F_(mu nu)dx^(mu)^^dx^(nu)=-E_(r)dt^^dr=-(e)/(r^(2))dt^^dr:}\begin{equation*} \boldsymbol{F}=\frac{1}{2} F_{\mu \nu} \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{\nu}=-E_{r} \boldsymbol{d} t \wedge \boldsymbol{d} r=-\frac{e}{r^{2}} \boldsymbol{d} t \wedge \boldsymbol{d} r \tag{4.16} \end{equation*}(4.16)F=12Fμνdxμdxν=Erdtdr=er2dtdr
Its dual, according to the prescription in exercise 3.14, is Maxwell:
(4.17) Maxwell = F = e sin θ d θ d φ (4.17)  Maxwell  = F = e sin θ d θ d φ {:(4.17)" Maxwell "=^(**)F=e sin theta d theta^^d varphi:}\begin{equation*} \text { Maxwell }={ }^{*} \boldsymbol{F}=e \sin \theta \boldsymbol{d} \theta \wedge \boldsymbol{d} \varphi \tag{4.17} \end{equation*}(4.17) Maxwell =F=esinθdθdφ
Pattern of tubes in dual structure Maxwell for point-charge at rest
as illustrated in Figure 4.5.
Take a tour in the positive sense around a region of the surface of the sphere illustrated in Figure 4.5. The number of tubes of F F **F* \boldsymbol{F}F encompassed in the route will be precisely
( number of tubes ) = e ( solid angle ) (  number   of tubes  ) = e (  solid   angle  ) ((" number ")/(" of tubes "))=e((" solid ")/(" angle "))\binom{\text { number }}{\text { of tubes }}=e\binom{\text { solid }}{\text { angle }}( number  of tubes )=e( solid  angle )
The whole number of tubes of F F **F\boldsymbol{*} \boldsymbol{F}F emergent over the entire sphere will be 4 π e 4 π e 4pi e4 \pi e4πe, in conformity with Faraday's picture of tubes of force.

Box 4.3 DUALITY OF 2-FORMS IN SPACETIME

Given a general 2-form (containing six exterior or wedge products)
F = E x d x d t + E y d y d t + + B z d x d y F = E x d x d t + E y d y d t + + B z d x d y F=E_(x)dx^^dt+E_(y)dy^^dt+cdots+B_(z)dx^^dy\boldsymbol{F}=E_{x} \boldsymbol{d} x \wedge \boldsymbol{d} t+E_{y} \boldsymbol{d} y \wedge \boldsymbol{d} t+\cdots+B_{z} \boldsymbol{d} x \wedge \boldsymbol{d} yF=Exdxdt+Eydydt++Bzdxdy
one gets to its dual ("perpendicular") by the prescription
F = B x d x d t + E y d z d x + E z d x d y F = B x d x d t + E y d z d x + E z d x d y **F=-B_(x)dx^^dt-cdots+E_(y)dz^^dx+E_(z)dx^^dy* \boldsymbol{F}=-B_{x} \boldsymbol{d} x \wedge \boldsymbol{d} t-\cdots+E_{y} \boldsymbol{d} z \wedge \boldsymbol{d} x+E_{z} \boldsymbol{d} x \wedge \boldsymbol{d} yF=Bxdxdt+Eydzdx+Ezdxdy

Duality Rotations

Note that the dual of the dual is the negative of the original 2 -form; thus
F = E x d x d t B z d x d y = F F = E x d x d t B z d x d y = F ^(****)F=-E_(x)dx^^dt-cdots-B_(z)dx^^dy=-F{ }^{* *} \boldsymbol{F}=-E_{x} \boldsymbol{d} x \wedge \boldsymbol{d} t-\cdots-B_{z} \boldsymbol{d} x \wedge \boldsymbol{d} y=-\boldsymbol{F}F=ExdxdtBzdxdy=F
In this sense ^(**){ }^{*} has the same property as the imaginary number i : = i i = 1 i : = i i = 1 i:^(****)=ii=-1i:^{* *}=i i=-1i:=ii=1. Thus one can write
e α = cos α + sin α e α = cos α + sin α e^(**alpha)=cos alpha+^(**)sin alphae^{* \alpha}=\cos \alpha+{ }^{*} \sin \alphaeα=cosα+sinα
This operation, applied to F F F\boldsymbol{F}F, carries attention from the generic 2 -form in its simplest representation (see exercise 4.1)
F = E x d x d t + B x d y d z F = E x d x d t + B x d y d z F=E_(x)dx^^dt+B_(x)dy^^dz\boldsymbol{F}=E_{x} \boldsymbol{d} x \wedge \boldsymbol{d} t+B_{x} \boldsymbol{d} y \wedge \boldsymbol{d} zF=Exdxdt+Bxdydz
to another "duality rotated electromagnetic field"
e α F = ( E x cos α B x sin α ) d x d t + ( B x cos α + E x sin α ) d y d z e α F = E x cos α B x sin α d x d t + B x cos α + E x sin α d y d z e^(**alpha)F=(E_(x)cos alpha-B_(x)sin alpha)dx^^dt+(B_(x)cos alpha+E_(x)sin alpha)dy^^dze^{* \alpha} \boldsymbol{F}=\left(E_{x} \cos \alpha-B_{x} \sin \alpha\right) \boldsymbol{d} x \wedge \boldsymbol{d} t+\left(B_{x} \cos \alpha+E_{x} \sin \alpha\right) \boldsymbol{d} y \wedge \boldsymbol{d} zeαF=(ExcosαBxsinα)dxdt+(Bxcosα+Exsinα)dydz
If the original field satisfied Maxwell's empty-space field equations, so does the new field. With suitable choice of the "complexion" α α alpha\alphaα, one can annul one of the two wedge products at any chosen point in spacetime and have for the other
( B x 2 + E x 2 ) 1 / 2 d y d z B x 2 + E x 2 1 / 2 d y d z (B_(x)^(2)+E_(x)^(2))^(1//2)dy^^dz\left(B_{x}^{2}+E_{x}^{2}\right)^{1 / 2} \boldsymbol{d} y \wedge \boldsymbol{d} z(Bx2+Ex2)1/2dydz
Field of a point-charge in motion
How can one determine the structure of tubes associated with a charged particle moving at a uniform velocity? First express * F F F\boldsymbol{F}F in rectangular coordinates moving with the particle (barred coordinates in this comoving "rocket" frame of reference; unbarred coordinates will be used later for a laboratory frame of reference). The relevant steps can be listed:
(a)
F = e sin θ ¯ d θ ¯ d φ ¯ = e ( d cos θ ¯ ) d φ ¯ F = e sin θ ¯ d θ ¯ d φ ¯ = e ( d cos θ ¯ ) d φ ¯ ^(**)F=e sin bar(theta)d bar(theta)^^d bar(varphi)=-e(d cos bar(theta))^^d bar(varphi){ }^{*} \boldsymbol{F}=e \sin \bar{\theta} \boldsymbol{d} \bar{\theta} \wedge \boldsymbol{d} \bar{\varphi}=-e(\boldsymbol{d} \cos \bar{\theta}) \wedge \boldsymbol{d} \bar{\varphi}F=esinθ¯dθ¯dφ¯=e(dcosθ¯)dφ¯
Figure 4.5.
The field of 2-forms Maxwell = F = e sin θ d θ d ϕ = F = e sin θ d θ d ϕ =^(**)F=e sin theta d theta^^d phi={ }^{*} \boldsymbol{F}=e \sin \theta \boldsymbol{d} \theta \wedge \boldsymbol{d} \phi=F=esinθdθdϕ that describes the electromagnetic field of a charge e e eee at rest at the origin. This picture is actually the intersection of **\boldsymbol{*} with a 3 -surface of constant time t t ttt; i.e., the time direction is suppressed from the picture.
(b)
φ ¯ = arctan y ¯ x ¯ ; d φ ¯ = x ¯ d y ¯ y ¯ d x ¯ x ¯ 2 + y ¯ 2 ; φ ¯ = arctan y ¯ x ¯ ; d φ ¯ = x ¯ d y ¯ y ¯ d x ¯ x ¯ 2 + y ¯ 2 ; bar(varphi)=arctan((( bar(y)))/(( bar(x))));quad d bar(varphi)=(( bar(x))d( bar(y))-( bar(y))d( bar(x)))/( bar(x)^(2)+ bar(y)^(2));\bar{\varphi}=\arctan \frac{\bar{y}}{\bar{x}} ; \quad \boldsymbol{d} \bar{\varphi}=\frac{\bar{x} \boldsymbol{d} \bar{y}-\bar{y} \boldsymbol{d} \bar{x}}{\bar{x}^{2}+\bar{y}^{2}} ;φ¯=arctany¯x¯;dφ¯=x¯dy¯y¯dx¯x¯2+y¯2;
(c)
cos θ ¯ = z ¯ r ¯ ; d ( cos θ ¯ ) = d z ¯ r ¯ + z ¯ r ¯ 3 ( x ¯ d x ¯ + y ¯ d y ¯ + z ¯ d z ¯ ) cos θ ¯ = z ¯ r ¯ ; d ( cos θ ¯ ) = d z ¯ r ¯ + z ¯ r ¯ 3 ( x ¯ d x ¯ + y ¯ d y ¯ + z ¯ d z ¯ ) cos bar(theta)=(( bar(z)))/(( bar(r)));quad-d(cos bar(theta))=(-d( bar(z)))/(( bar(r)))+(( bar(z)))/( bar(r)^(3))( bar(x)d bar(x)+ bar(y)d bar(y)+ bar(z)d bar(z))\cos \bar{\theta}=\frac{\bar{z}}{\bar{r}} ; \quad-\boldsymbol{d}(\cos \bar{\theta})=\frac{-\boldsymbol{d} \bar{z}}{\bar{r}}+\frac{\bar{z}}{\bar{r}^{3}}(\bar{x} \boldsymbol{d} \bar{x}+\bar{y} \boldsymbol{d} \bar{y}+\bar{z} \boldsymbol{d} \bar{z})cosθ¯=z¯r¯;d(cosθ¯)=dz¯r¯+z¯r¯3(x¯dx¯+y¯dy¯+z¯dz¯)
(d) combine to find
(4.18) F = ( e / r ¯ 3 ) ( x ¯ d y ¯ d z ¯ + y ¯ d z ¯ d x ¯ + z ¯ d x ¯ d y ¯ ) (4.18) F = e / r ¯ 3 ( x ¯ d y ¯ d z ¯ + y ¯ d z ¯ d x ¯ + z ¯ d x ¯ d y ¯ ) {:(4.18)^(**)F=(e// bar(r)^(3))( bar(x)d bar(y)^^d bar(z)+ bar(y)d bar(z)^^d bar(x)+ bar(z)d bar(x)^^d bar(y)):}\begin{equation*} { }^{*} \boldsymbol{F}=\left(e / \bar{r}^{3}\right)(\bar{x} \boldsymbol{d} \bar{y} \wedge \boldsymbol{d} \bar{z}+\bar{y} \boldsymbol{d} \bar{z} \wedge \boldsymbol{d} \bar{x}+\bar{z} \boldsymbol{d} \bar{x} \wedge \boldsymbol{d} \bar{y}) \tag{4.18} \end{equation*}(4.18)F=(e/r¯3)(x¯dy¯dz¯+y¯dz¯dx¯+z¯dx¯dy¯)
(electromagnetic field of point charge in a comoving Cartesian system; spherically symmetric). Now transform to laboratory coordinates:
velocity parameter α α alpha\alphaα
velocity β = tanh α β = tanh α beta=tanh alpha\beta=\tanh \alphaβ=tanhα
1 1 β 2 = cosh α , β 1 β 2 = sinh α 1 1 β 2 = cosh α , β 1 β 2 = sinh α (1)/(sqrt(1-beta^(2)))=cosh alpha,quad(beta)/(sqrt(1-beta^(2)))=sinh alpha\frac{1}{\sqrt{1-\beta^{2}}}=\cosh \alpha, \quad \frac{\beta}{\sqrt{1-\beta^{2}}}=\sinh \alpha11β2=coshα,β1β2=sinhα
(a)
{ t ¯ = t cosh α x sinh α , x ¯ = t sinh α + x cosh α , y ¯ = y z ¯ = z t ¯ = t cosh α x sinh α , x ¯ = t sinh α + x cosh α , y ¯ = y z ¯ = z {[ bar(t)=t cosh alpha-x sinh alpha","],[ bar(x)=-t sinh alpha+x cosh alpha","],[ bar(y)=y quad bar(z)=z]:}\left\{\begin{array}{l} \bar{t}=t \cosh \alpha-x \sinh \alpha, \\ \bar{x}=-t \sinh \alpha+x \cosh \alpha, \\ \bar{y}=y \quad \bar{z}=z \end{array}\right.{t¯=tcoshαxsinhα,x¯=tsinhα+xcoshα,y¯=yz¯=z
(b)
r ¯ = [ ( x cosh α t sinh α ) 2 + y 2 + z 2 ] 1 / 2 r ¯ = ( x cosh α t sinh α ) 2 + y 2 + z 2 1 / 2 bar(r)=[(x cosh alpha-t sinh alpha)^(2)+y^(2)+z^(2)]^(1//2)\bar{r}=\left[(x \cosh \alpha-t \sinh \alpha)^{2}+y^{2}+z^{2}\right]^{1 / 2}r¯=[(xcoshαtsinhα)2+y2+z2]1/2
(c) F = ( e / r ¯ 3 ) [ ( x cosh α t sinh α ) d y d z + y d z F = e / r ¯ 3 [ ( x cosh α t sinh α ) d y d z + y d z ^(**)F=(e// bar(r)^(3))[(x cosh alpha-t sinh alpha)dy^^dz+ydz^^{ }^{*} \boldsymbol{F}=\left(e / \bar{r}^{3}\right)[(x \cosh \alpha-t \sinh \alpha) \boldsymbol{d} y \wedge \boldsymbol{d} z+y \boldsymbol{d} z \wedgeF=(e/r¯3)[(xcoshαtsinhα)dydz+ydz
(4.19) ( cosh α d x sinh α d t ) + z ( cosh α d x sinh α d t ) d y ] (4.19) ( cosh α d x sinh α d t ) + z ( cosh α d x sinh α d t ) d y ] {:(4.19)(cosh alpha dx-sinh alpha dt)+z(cosh alpha dx-sinh alpha dt)^^dy]:}\begin{equation*} (\cosh \alpha \boldsymbol{d} x-\sinh \alpha \boldsymbol{d} t)+z(\cosh \alpha \boldsymbol{d} x-\sinh \alpha \boldsymbol{d} t) \wedge \boldsymbol{d} y] \tag{4.19} \end{equation*}(4.19)(coshαdxsinhαdt)+z(coshαdxsinhαdt)dy]
(d) compare with the general dual 2-form,
F = E x d y d z + E y d z d x + E z d x d y + B x d t d x + B y d t d y + B z d t d z F = E x d y d z + E y d z d x + E z d x d y + B x d t d x + B y d t d y + B z d t d z {:[**F=E_(x)dy^^dz+E_(y)dz^^dx+E_(z)dx^^dy],[+B_(x)dt^^dx+B_(y)dt^^dy+B_(z)dt^^dz]:}\begin{aligned} * \boldsymbol{F}= & E_{x} \boldsymbol{d} y \wedge \boldsymbol{d} z+E_{y} \boldsymbol{d} z \wedge \boldsymbol{d} x+E_{z} \boldsymbol{d} x \wedge \boldsymbol{d} y \\ & +B_{x} \boldsymbol{d} t \wedge \boldsymbol{d} x+B_{y} \boldsymbol{d} t \wedge \boldsymbol{d} y+B_{z} \boldsymbol{d} t \wedge \boldsymbol{d} z \end{aligned}F=Exdydz+Eydzdx+Ezdxdy+Bxdtdx+Bydtdy+Bzdtdz
and get the desired individual field components
(e) { E x = ( e / r ¯ 3 ) ( x cosh α t sinh α ) , B x = 0 , E y = ( e / / 3 ) y cosh α , B y = ( e / r ¯ 3 ) z sinh α , E z = ( e / r 3 ) z cosh α , B z = ( e / r 3 ) y sinh α . E x = e / r ¯ 3 ( x cosh α t sinh α ) ,      B x = 0 , E y = e / / 3 y cosh α ,      B y = e / r ¯ 3 z sinh α , E z = e / r 3 z cosh α ,      B z = e / r 3 y sinh α . {[E_(x)=(e// bar(r)^(3))(x cosh alpha-t sinh alpha)",",B_(x)=0","],[E_(y)=(e////^(3))y cosh alpha",",B_(y)=-(e// bar(r)^(3))z sinh alpha","],[E_(z)=(e//r^(3))z cosh alpha",",B_(z)=(e//r^(3))y sinh alpha.]:}\begin{cases}E_{x}=\left(e / \bar{r}^{3}\right)(x \cosh \alpha-t \sinh \alpha), & B_{x}=0, \\ E_{y}=\left(e / /^{3}\right) y \cosh \alpha, & B_{y}=-\left(e / \bar{r}^{3}\right) z \sinh \alpha, \\ E_{z}=\left(e / r^{3}\right) z \cosh \alpha, & B_{z}=\left(e / r^{3}\right) y \sinh \alpha .\end{cases}{Ex=(e/r¯3)(xcoshαtsinhα),Bx=0,Ey=(e//3)ycoshα,By=(e/r¯3)zsinhα,Ez=(e/r3)zcoshα,Bz=(e/r3)ysinhα.
One can verify that the invariants
(4.21) B 2 E 2 = 1 2 F α β F α β , (4.22) E B = 1 4 F α β F α β (4.21) B 2 E 2 = 1 2 F α β F α β , (4.22) E B = 1 4 F α β F α β {:[(4.21)B^(2)-E^(2)=(1)/(2)F_(alpha beta)F^(alpha beta)","],[(4.22)E*B=(1)/(4)F_(alpha beta)^(**)F^(alpha beta)]:}\begin{align*} & \boldsymbol{B}^{2}-\boldsymbol{E}^{2}=\frac{1}{2} F_{\alpha \beta} F^{\alpha \beta}, \tag{4.21}\\ & \boldsymbol{E} \cdot \boldsymbol{B}=\frac{1}{4} F_{\alpha \beta}{ }^{*} F^{\alpha \beta} \tag{4.22} \end{align*}(4.21)B2E2=12FαβFαβ,(4.22)EB=14FαβFαβ
have the same value in the laboratory frame as in the rocket frame, as required. Note that the honeycomb structure of the differential form is not changed when one goes from the rocket frame to the laboratory frame. What changes is only the mathematical formula that describes it.

§4.4. RADIATION FIELDS

The Maxwell structure of tubes associated with a charge in uniform motion is more remarkable than it may seem at first sight, and not only because of the Lorentz contraction of the tubes in the direction of motion. The tubes arbitrarily far away move on in military step with the charge on which they center, despite the fact that there is no time for information "emitted" from the charge "right now" to get to the faraway tube "right now." The structure of the faraway tubes "right now" must therefore derive from the charge at an earlier moment on its uniform-motion, straight-line trajectory. This circumstance shows up nowhere more clearly than in what happens to the field in consequence of a sudden change, in a short time Δ τ Δ τ Delta tau\Delta \tauΔτ, from one uniform velocity to another uniform velocity (Figure 4.6). The tubes have the standard patterns for the two states of motion, one pattern within a sphere of radius r r rrr, the other outside that sphere, where r r rrr is equal to the lapse of time (" cm of light-travel time") since the acceleration took place. The necessity for the two patterns to fit together in the intervening zone, of thickness Δ r = Δ τ Δ r = Δ τ Delta r=Delta tau\Delta r=\Delta \tauΔr=Δτ, forces the field there to be multiplied up by a "stretching factor," proportional to r r rrr. This factor is responsible for the well-known fact that radiative forces fall off inversely only as the first power of the distance (Figure 4.6).
When the charge continuously changes its state of motion, the structure of the electromagnetic field, though based on the same simple principles as those illustrated in Figure 4.6, nevertheless looks more complex. The following is the Faraday 2-form
Figure 4.6.
Mechanism of radiation. J. J. Thomson's way to understand why the strength of an electromagnetic wave falls only as the inverse first power of distance r r rrr and why the amplitude of the wave varies (for low velocities) as sin θ sin θ sin theta\sin \thetasinθ (maximum in the plane perpendicular to the line of acceleration). The charge was moving to the left at uniform velocity. Far away from it, the lines of force continue to move as if this uniform velocity were going to continue forever (Coulomb field of point-charge in slow motion). However, closer up the field is that of a point-change moving to the right with uniform velocity ( 1 / r 2 1 / r 2 1//r^(2)1 / r^{2}1/r2 dependence of strength upon distance). The change from the one field pattern to another is confined to a shell of thickness Δ τ Δ τ Delta tau\Delta \tauΔτ located at a distance r r rrr from the point of acceleration (amplification of field by "stretching factor" r sin θ Δ β / Δ τ r sin θ Δ β / Δ τ r sin theta Delta beta//Delta taur \sin \theta \Delta \beta / \Delta \taursinθΔβ/Δτ; see text). We thank C C CCC. Teitelboim for the construction of this diagram.
for the field of an electric dipole of magnitude p 1 p 1 p_(1)p_{1}p1 oscillating up and down parallel
Field of an oscillating dipole to the z z zzz-axis:
F = E x d x d t + + B x d y d z + = real part of { p 1 e i ω r i ω t [ 2 cos θ ( 1 r 3 i ω r 2 ) gives E r d r d t + sin θ ( 1 r 3 i ω r 2 ω 2 r ) gives E θ r d θ d t sin θ ( i ω r 2 ω 2 r ) gives B ϕ d r r d θ ] } F = E x d x d t + + B x d y d z + =  real part of  p 1 e i ω r i ω t 2 cos θ 1 r 3 i ω r 2 gives  E r d r d t + sin θ 1 r 3 i ω r 2 ω 2 r gives  E θ r d θ d t sin θ i ω r 2 ω 2 r gives  B ϕ d r r d θ ] } {:[F=E_(x)dx^^dt+cdots+B_(x)dy^^dz+cdots=" real part of "{p_(1)e^(i omega r-i omega t):}],[ubrace([2cos theta((1)/(r^(3))-(i omega)/(r^(2)))ubrace)_("gives "E_(r))dr^^dt+ubrace(sin theta((1)/(r^(3))-(i omega)/(r^(2))-(omega^(2))/(r))ubrace)_("gives "E_(theta))rd theta^^dt],[ubrace(sin theta((-i omega)/(r^(2))-(omega^(2))/(r))ubrace)_("gives "B_(phi))dr^^rd theta]}]:}\begin{align*} \boldsymbol{F}= & E_{x} \boldsymbol{d} x \wedge \boldsymbol{d} t+\cdots+B_{x} \boldsymbol{d} y \wedge \boldsymbol{d} z+\cdots=\text { real part of }\left\{p_{1} e^{i \omega r-i \omega t}\right. \\ & \underbrace{\left[2 \cos \theta\left(\frac{1}{r^{3}}-\frac{i \omega}{r^{2}}\right)\right.}_{\text {gives } E_{r}} \boldsymbol{d} r \wedge \boldsymbol{d} t+\underbrace{\sin \theta\left(\frac{1}{r^{3}}-\frac{i \omega}{r^{2}}-\frac{\omega^{2}}{r}\right)}_{\text {gives } E_{\theta}} r \boldsymbol{d} \theta \wedge \boldsymbol{d} t \\ & \underbrace{\sin \theta\left(\frac{-i \omega}{r^{2}}-\frac{\omega^{2}}{r}\right)}_{\text {gives } B_{\phi}} \boldsymbol{d} r \wedge r \boldsymbol{d} \theta]\} \end{align*}F=Exdxdt++Bxdydz+= real part of {p1eiωriωt[2cosθ(1r3iωr2)gives Erdrdt+sinθ(1r3iωr2ω2r)gives Eθrdθdtsinθ(iωr2ω2r)gives Bϕdrrdθ]}
and the dual 2 -form Maxwell = F = F =^(**)F={ }^{*} \boldsymbol{F}=F is
F = B x d x d t + E x d y d z + = real part of { p 1 e i ω r i ω t gives B ϕ sin θ ( i ω r 2 ω 2 r ) gives E r d t r sin θ d ϕ 2 cos θ ( 1 r 3 i ω r 2 ) gives E θ r d θ r sin θ d ϕ sin sin θ d ϕ d r ] } . r 3 θ ( 1 r 2 i ω r ) F = B x d x d t + E x d y d z + =  real part of  p 1 e i ω r i ω t gives  B ϕ sin θ i ω r 2 ω 2 r gives  E r d t r sin θ d ϕ 2 cos θ 1 r 3 i ω r 2 gives  E θ r d θ r sin θ d ϕ sin sin θ d ϕ d r ] } . r 3 θ 1 r 2 i ω r {:[**F=-ubrace(-B_(x)dx^^dt-cdots+E_(x)dy^^dz+cdots=" real part of "{p_(1)e^(i omega r-i omega t)ubrace)_("gives "B_(phi))],[ubrace(sin theta((-i omega)/(r^(2))-(omega^(2))/(r))ubrace)_("gives "E_(r))dt^^r sin theta d phi],[ubrace(2cos theta((1)/(r^(3))-(i omega)/(r^(2)))ubrace)_("gives "E_(theta))rd theta^^r sin theta d phi],[ubrace(sin sin theta d phi^^dr]}.ubrace)_(r^(3)theta((1)/(r^(2))-(i omega)/(r)))]:}\begin{align*} * \boldsymbol{F}= & -\underbrace{-B_{x} \boldsymbol{d} x \wedge \boldsymbol{d} t-\cdots+E_{x} \boldsymbol{d} y \wedge \boldsymbol{d} z+\cdots=\text { real part of }\left\{p_{1} e^{i \omega r-i \omega t}\right.}_{\text {gives } B_{\phi}} \\ & \underbrace{\sin \theta\left(\frac{-i \omega}{r^{2}}-\frac{\omega^{2}}{r}\right)}_{\text {gives } E_{r}} \boldsymbol{d} t \wedge r \sin \theta \boldsymbol{d} \phi \\ & \underbrace{2 \cos \theta\left(\frac{1}{r^{3}}-\frac{i \omega}{r^{2}}\right)}_{\text {gives } E_{\theta}} r \boldsymbol{d} \theta \wedge r \sin \theta \boldsymbol{d} \phi \\ & \underbrace{\sin \sin \theta \boldsymbol{d} \phi \wedge \boldsymbol{d} r]\} .}_{r^{3} \theta\left(\frac{1}{r^{2}}-\frac{i \omega}{r}\right)} \end{align*}F=Bxdxdt+Exdydz+= real part of {p1eiωriωtgives Bϕsinθ(iωr2ω2r)gives Erdtrsinθdϕ2cosθ(1r3iωr2)gives Eθrdθrsinθdϕsinsinθdϕdr]}.r3θ(1r2iωr)

§4.5. MAXWELL'S EQUATIONS

The general 2 -form F F F\boldsymbol{F}F is written as a superposition of wedge products with a factor 1 2 1 2 (1)/(2)\frac{1}{2}12,
(4.25) F = 1 2 F μ ν d x μ d x ν (4.25) F = 1 2 F μ ν d x μ d x ν {:(4.25)F=(1)/(2)F_(mu nu)dx^(mu)^^dx^(nu):}\begin{equation*} \boldsymbol{F}=\frac{1}{2} F_{\mu \nu} \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{\nu} \tag{4.25} \end{equation*}(4.25)F=12Fμνdxμdxν
because the typical term appears twice, once as F x y d x d y F x y d x d y F_(xy)dx^^dyF_{x y} \boldsymbol{d} x \wedge \boldsymbol{d} yFxydxdy and the second time as F y x d y d x F y x d y d x F_(yx)dy^^dxF_{y x} \boldsymbol{d} y \wedge \boldsymbol{d} xFyxdydx, with F y x = F x y F y x = F x y F_(yx)=-F_(xy)F_{y x}=-F_{x y}Fyx=Fxy and d y d x = d x d y d y d x = d x d y dy^^dx=-dx^^dy\boldsymbol{d} y \wedge \boldsymbol{d} x=-\boldsymbol{d} x \wedge \boldsymbol{d} ydydx=dxdy.
If differentiation ("taking the gradient"; the operator d d d\boldsymbol{d}d ) produced out of a scalar a 1-form, it is also true that differentiation (again the operator d d d\boldsymbol{d}d, but now generally known under Cartan's name of "exterior differentiation") produces a 2 -form out of the general 1-form; and applied to a 2 -form produces a 3 -form; and applied to a 3 -form produces a 4 -form, the form of the highest order that spacetime will accommodate. Write the general f f fff-form as
(4.26) ϕ = 1 f ! ϕ α 1 α 2 α f d x α 1 d x α 2 d x α f (4.26) ϕ = 1 f ! ϕ α 1 α 2 α f d x α 1 d x α 2 d x α f {:(4.26)phi=(1)/(f!)phi_(alpha_(1)alpha_(2)dotsalpha_(f))dx^(alpha_(1))^^dx^(alpha_(2))^^cdots^^dx^(alpha_(f)):}\begin{equation*} \boldsymbol{\phi}=\frac{1}{f!} \phi_{\alpha_{1} \alpha_{2} \ldots \alpha_{f}} \boldsymbol{d} x^{\alpha_{1}} \wedge \boldsymbol{d} x^{\alpha_{2}} \wedge \cdots \wedge \boldsymbol{d} x^{\alpha_{f}} \tag{4.26} \end{equation*}(4.26)ϕ=1f!ϕα1α2αfdxα1dxα2dxαf
where the coefficient ϕ α 1 α 2 α f ϕ α 1 α 2 α f phi_(alpha_(1)alpha_(2)dotsalpha_(f))\phi_{\alpha_{1} \alpha_{2} \ldots \alpha_{f}}ϕα1α2αf, like the wedge product that follows it, is antisym-
Taking exterior derivative metric under interchange of any two indices. Then the exterior derivative of ϕ ϕ phi\boldsymbol{\phi}ϕ is
(4.27) d ϕ 1 f ! ϕ α 1 α 2 α f x α 0 d x α 0 d x α 1 d x α 2 d x α ϝ (4.27) d ϕ 1 f ! ϕ α 1 α 2 α f x α 0 d x α 0 d x α 1 d x α 2 d x α ϝ {:(4.27)d phi-=(1)/(f!)(delphi_(alpha_(1)alpha_(2)dotsalpha_(f)))/(delx^(alpha_(0)))dx^(alpha_(0))^^dx^(alpha_(1))^^dx^(alpha_(2))^^cdots^^dx^(alpha_(ϝ)):}\begin{equation*} \boldsymbol{d} \boldsymbol{\phi} \equiv \frac{1}{f!} \frac{\partial \phi_{\alpha_{1} \alpha_{2} \ldots \alpha_{f}}}{\partial x^{\alpha_{0}}} \boldsymbol{d} x^{\alpha_{0}} \wedge \boldsymbol{d} x^{\alpha_{1}} \wedge \boldsymbol{d} x^{\alpha_{2}} \wedge \cdots \wedge \boldsymbol{d} x^{\alpha_{\digamma}} \tag{4.27} \end{equation*}(4.27)dϕ1f!ϕα1α2αfxα0dxα0dxα1dxα2dxαϝ
Take the exterior derivative of Faraday according to this rule and find that it vanishes, not only for the special case of the dipole oscillator, but also for a general electromagnetic field. Thus, in the coordinates appropriate for a local Lorentz frame, one has
d F = d ( E x d x d t + + B x d y d z + ) = ( E x t d t + E x x d x + E x y d y + E x z d z ) d x d t (4.28) + ( 5 more such sets of 4 terms each ) d F = d E x d x d t + + B x d y d z + = E x t d t + E x x d x + E x y d y + E x z d z d x d t (4.28) + ( 5  more such sets of  4  terms each  ) {:[dF=d(E_(x)dx^^dt+cdots+B_(x)dy^^dz+cdots)],[=((delE_(x))/(del t)dt+(delE_(x))/(del x)dx+(delE_(x))/(del y)dy+(delE_(x))/(del z)dz)^^dx^^dt],[(4.28)+cdots(5" more such sets of "4" terms each ")cdots]:}\begin{align*} \boldsymbol{d} \boldsymbol{F}= & \boldsymbol{d}\left(E_{x} \boldsymbol{d} x \wedge \boldsymbol{d} t+\cdots+B_{x} \boldsymbol{d} y \wedge \boldsymbol{d} z+\cdots\right) \\ = & \left(\frac{\partial E_{x}}{\partial t} \boldsymbol{d} t+\frac{\partial E_{x}}{\partial x} \boldsymbol{d} x+\frac{\partial E_{x}}{\partial y} \boldsymbol{d} y+\frac{\partial E_{x}}{\partial z} \boldsymbol{d} z\right) \wedge \boldsymbol{d} x \wedge \boldsymbol{d} t \\ & +\cdots(5 \text { more such sets of } 4 \text { terms each }) \cdots \tag{4.28} \end{align*}dF=d(Exdxdt++Bxdydz+)=(Extdt+Exxdx+Exydy+Exzdz)dxdt(4.28)+(5 more such sets of 4 terms each )
Note that such a term as d y d y d z d y d y d z dy^^dy^^dz\boldsymbol{d} y \wedge \boldsymbol{d} y \wedge \boldsymbol{d} zdydydz is automatically zero ("collapse of egg-crate cell when stamped on"). Collect the terms that do not vanish and find
d F = ( B x x + B y y + B z z ) d x d y d z + ( B x t + E z y E y z ) d t d y d z + ( B y t + E x z E z x ) d t d z d x (4.29) + ( B z t + E y x E x y ) d t d x d y . d F = B x x + B y y + B z z d x d y d z + B x t + E z y E y z d t d y d z + B y t + E x z E z x d t d z d x (4.29) + B z t + E y x E x y d t d x d y . {:[dF=((delB_(x))/(del x)+(delB_(y))/(del y)+(delB_(z))/(del z))dx^^dy^^dz],[+((delB_(x))/(del t)+(delE_(z))/(del y)-(delE_(y))/(del z))dt^^dy^^dz],[+((delB_(y))/(del t)+(delE_(x))/(del z)-(delE_(z))/(del x))dt^^dz^^dx],[(4.29)+((delB_(z))/(del t)+(delE_(y))/(del x)-(delE_(x))/(del y))dt^^dx^^dy.]:}\begin{align*} \boldsymbol{d} \boldsymbol{F}= & \left(\frac{\partial B_{x}}{\partial x}+\frac{\partial B_{y}}{\partial y}+\frac{\partial B_{z}}{\partial z}\right) \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & +\left(\frac{\partial B_{x}}{\partial t}+\frac{\partial E_{z}}{\partial y}-\frac{\partial E_{y}}{\partial z}\right) \boldsymbol{d} t \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & +\left(\frac{\partial B_{y}}{\partial t}+\frac{\partial E_{x}}{\partial z}-\frac{\partial E_{z}}{\partial x}\right) \boldsymbol{d} t \wedge \boldsymbol{d} z \wedge \boldsymbol{d} x \\ & +\left(\frac{\partial B_{z}}{\partial t}+\frac{\partial E_{y}}{\partial x}-\frac{\partial E_{x}}{\partial y}\right) \boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y . \tag{4.29} \end{align*}dF=(Bxx+Byy+Bzz)dxdydz+(Bxt+EzyEyz)dtdydz+(Byt+ExzEzx)dtdzdx(4.29)+(Bzt+EyxExy)dtdxdy.
Each term in this expression is familiar from Maxwell's equations
div B = B = 0 div B = B = 0 div B=grad*B=0\operatorname{div} \boldsymbol{B}=\boldsymbol{\nabla} \cdot \boldsymbol{B}=0divB=B=0
and
curl E = × E = B ˙ curl E = × E = B ˙ curl E=grad xx E=-B^(˙)\operatorname{curl} E=\nabla \times E=-\dot{\boldsymbol{B}}curlE=×E=B˙
Each vanishes, and with their vanishing Faraday itself is seen to have zero exterior derivative:
(4.30) d F = 0 . (4.30) d F = 0 . {:(4.30)dF=0.:}\begin{equation*} \boldsymbol{d} \boldsymbol{F}=0 . \tag{4.30} \end{equation*}(4.30)dF=0.
In other words, "Faraday is a closed 2-form"; "the tubes of F F F\boldsymbol{F}F nowhere come to an end."
A similar calculation gives for the exterior derivative of the dual 2-form Maxwell the result
d F = d ( B x d x d t + E x d y d z + ) = ( E x x + E y y + E z z ) d x d y d z + ( E x t B z y + B y z ) d t d y d z + = 4 π ( ρ d x d y d z J x d t d y d z J y d t d z d x J z d t d x d y ) = 4 π J (4.31) d F = 4 π J . d F = d B x d x d t + E x d y d z + = E x x + E y y + E z z d x d y d z + E x t B z y + B y z d t d y d z + = 4 π ( ρ d x d y d z J x d t d y d z J y d t d z d x J z d t d x d y = 4 π J (4.31) d F = 4 π J . {:[d^(**)F=d(-B_(x)dx^^dt-cdots+E_(x)dy^^dz+cdots)],[=((delE_(x))/(del x)+(delE_(y))/(del y)+(delE_(z))/(del z))dx^^dy^^dz],[+((delE_(x))/(del t)-(delB_(z))/(del y)+(delB_(y))/(del z))dt^^dy^^dz],[+cdots],[=4pi(rho dx^^dy^^dz],[-J_(x)dt^^dy^^dz],[-J_(y)dt^^dz^^dx],[{:-J_(z)dt^^dx^^dy)=4pi^(**)J],[(4.31)d^(**)F=4pi**J.]:}\begin{align*} \boldsymbol{d}^{*} \boldsymbol{F}= & \boldsymbol{d}\left(-B_{x} \boldsymbol{d} x \wedge \boldsymbol{d} t-\cdots+E_{x} \boldsymbol{d} y \wedge \boldsymbol{d} z+\cdots\right) \\ = & \left(\frac{\partial E_{x}}{\partial x}+\frac{\partial E_{y}}{\partial y}+\frac{\partial E_{z}}{\partial z}\right) \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & +\left(\frac{\partial E_{x}}{\partial t}-\frac{\partial B_{z}}{\partial y}+\frac{\partial B_{y}}{\partial z}\right) \boldsymbol{d} t \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & +\cdots \\ = & 4 \pi(\rho \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & -J_{x} \boldsymbol{d} t \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & -J_{y} \boldsymbol{d} t \wedge \boldsymbol{d} z \wedge \boldsymbol{d} x \\ & \left.-J_{z} \boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y\right)=4 \pi^{*} \boldsymbol{J} \\ \boldsymbol{d}^{*} \boldsymbol{F}= & 4 \pi * \boldsymbol{J} . \tag{4.31} \end{align*}dF=d(Bxdxdt+Exdydz+)=(Exx+Eyy+Ezz)dxdydz+(ExtBzy+Byz)dtdydz+=4π(ρdxdydzJxdtdydzJydtdzdxJzdtdxdy)=4πJ(4.31)dF=4πJ.
Faraday structure: tubes nowhere end
Maxwell structure: density of tube endings given by charge-current 3 -form
In empty space this exterior derivative, too, vanishes; there Maxwell is a closed 2 -form; the tubes of F F ^(**)F{ }^{*} \boldsymbol{F}F, like the tubes of F F F\boldsymbol{F}F, nowhere come to an end.
In a region where charge is present, the situation changes. Tubes of Maxwell take their origin in such a region. The density of endings is described by the 3 -form *J = charge, a "collection of eggcrate cells" collected along bundles of world lines.
The two equations
d F = 0 d F = 0 dF=0\boldsymbol{d} \boldsymbol{F}=0dF=0
and
d F = 4 π J d F = 4 π J d^(**)F=4pi^(**)J\boldsymbol{d}^{*} \boldsymbol{F}=4 \pi^{*} \boldsymbol{J}dF=4πJ
summarize the entire content of Maxwell's equations in geometric language. The forms F = F = F=\boldsymbol{F}=F= Faraday, and F = M a x w e l l F = M a x w e l l ^(**)F=Maxwell{ }^{*} \boldsymbol{F}=\boldsymbol{M a x w e l l}F=Maxwell, can be described in any coordinates one pleases-or in a language (honeycomb and egg-crate structures) free of any reference whatsoever to coordinates. Remarkably, neither equation makes any reference whatsoever to metric. As Hermann Weyl was one of the most emphatic in stressing (see also Chapters 8 and 9), the concepts of form and exterior derivative are metricfree. Metric made an appearance only in one place, in the concept of duality ("perpendicularity") that carried attention from F F F\boldsymbol{F}F to the dual structure F F ^(**)F{ }^{*} \boldsymbol{F}F.

§4.6. EXTERIOR DERIVATIVE AND CLOSED FORMS

The words "honeycomb" and "egg crate" may have given some feeling for the geometry that goes with electrodynamics. Now to spell out these concepts more clearly and illustrate in geometric terms, with electrodynamics as subject matter, what it means to speak of "exterior differentiation." Marching around a boundary, yes; but how and why and with what consequences? It is helpful to return to functions and 1 -forms, and see them and the 2 -forms Faraday and Maxwell and the 3 -form charge as part of an ordered progression (see Box 4.4). Two-forms are seen in this box to be of two kinds: (1) a special 2 -form, known as a "closed" 2 -form, which has the property that as many tubes enter a closed 2 -surface as emerge from it (exterior derivative of 2-form zero; no 3-form derivable from it other than the trivial zero 3 -form!); and (2) a general 2 -form, which sends across a closed 2 -surface a non-zero net number of tubes, and therefore permits one to define a nontrivial 3-form ("exterior derivative of the 2 -form"), which has precisely as many egg-crate cells in any closed 2 -surface as the net number of tubes of the 2 -form emerging from that same closed 2 -surface (generalization of Faraday's concept of tubes of force to the world of spacetime, curved as well as flat).

Box 4.4 THE PROGRESSION OF FORMS AND EXTERIOR DERIVATIVES

O-Form or Scalar, f f fff
An example in the context of 3 -space and Newtonian physics is temperature, T ( x , y , z ) T ( x , y , z ) T(x,y,z)T(x, y, z)T(x,y,z), and in the context of spacetime, a scalar potential, ϕ ( t , x , y , z ) ϕ ( t , x , y , z ) phi(t,x,y,z)\phi(t, x, y, z)ϕ(t,x,y,z).

From Scalar to 1-Form

Take the gradient or "exterior derivative" of a scalar f f fff to obtain a special 1-form, γ = d f γ = d f gamma=df\boldsymbol{\gamma}=\boldsymbol{d} fγ=df. Comments: (a) Any additive constant included in f f fff is erased in the process of differentiation; the quantity n n nnn in the diagram at the left is unknown and irrelevant. (b) The 1 -form γ γ gamma\boldsymbol{\gamma}γ is special in the sense that surfaces in one region "mesh" with surfaces in a neighboring region ("closed 1 -form"). (c) Line integral d S / j d f d S / j d f int_(d)^(S//j)df\int_{d}^{S / j} \boldsymbol{d} fdS/jdf is independent of path for any class of paths equivalent to one another under continuous deformation. (d) The 1 -form is a machine to produce a number ("bongs of bell" as each successive integral surface is crossed) out of a displacement (approximation to concept of a tangent vector).

General 1-Form β = β α d x α β = β α d x α beta=beta_(alpha)dx^(alpha)\boldsymbol{\beta}=\beta_{\alpha} \boldsymbol{d} x^{\alpha}β=βαdxα

This is a pattern of surfaces, as illustrated in the diagram at the right; i.e., a machine to produce a number ("bongs of bell"; β , u β , u (:beta,u:)\langle\boldsymbol{\beta}, \boldsymbol{u}\rangleβ,u ) out of a vector. A 1-form has a reality and position in space independent of all choice of coordinates. Surfaces do not ordinarily mesh. Integral β β int beta\int \boldsymbol{\beta}β around indicated closed loop does not give zero ("more bongs than antibongs").
From 1-Form to 2-Form ξ = d β = β α x μ d x μ d x α ξ = d β = β α x μ d x μ d x α xi=d beta=(delbeta_(alpha))/(delx^(mu))dx^(mu)^^dx^(alpha)\boldsymbol{\xi}=\boldsymbol{d} \boldsymbol{\beta}=\frac{\partial \beta_{\alpha}}{\partial x^{\mu}} \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{\alpha}ξ=dβ=βαxμdxμdxα
ξ ξ xi\xiξ is a pattern of honeycomb-like cells, with a direction of circulation marked on each, so stationed
Box 4.4 (continued)
that the number of cells encompassed in the dotted closed path is identical to the net contribution (excess of bongs over antibongs) for the same path in the diagram of β β beta\boldsymbol{\beta}β above. The "exterior derivative" is defined so this shall be so; the generalized Stokes theorem codifies it. The word "exterior" comes from the fact that the path goes around the periphery of the region under analysis. Thus the 2 -form is a machine to get a number (number of tubes, ξ , u v ξ , u v (:xi,u^^v:)\langle\xi, \boldsymbol{u} \wedge \boldsymbol{v}\rangleξ,uv ) out of a bit of surface ( u v ) ( u v ) (u^^v)(\boldsymbol{u} \wedge \boldsymbol{v})(uv) that has a sense of circulation indicated upon it. The 2 -form thus defined is special in this sense: a rubber sheet "supported around its edges" by
the dotted curve or any other closed curve is crossed by the same number of tubes when: (a) it bulges up in the middle; (b) it is pushed down in the middle; (c) it experiences any other continuous deformation. The Faraday or 2 -form F F F\boldsymbol{F}F of electromagnetism, always expressible as F = d A F = d A F=dA\boldsymbol{F}=\boldsymbol{d} \boldsymbol{A}F=dA ( A = 4 A = 4 A=4\boldsymbol{A}=4A=4-potential, a 1 -form), also has always this special property ("conservation of tubes").

O-Form to 1 1 1\mathbf{1}1-Form to 2 -Form? No!

Go from scalar f f fff to 1 -form γ = d f γ = d f gamma=df\boldsymbol{\gamma}=\boldsymbol{d} fγ=df. The next step to a 2 -form α α alpha\boldsymbol{\alpha}α is vacuous. The net contribution of the line integral y y int y\int yy around the dotted closed path is automatically zero. To reproduce that zero result requires a zero 2 -form. Thus α = d γ = d d f α = d γ = d d f alpha=d gamma=ddf\boldsymbol{\alpha}=\boldsymbol{d} \boldsymbol{\gamma}=\boldsymbol{d} \boldsymbol{d} fα=dγ=ddf has to be the zero 2 -form. This result is a special instance of the general result d d = 0 d d = 0 dd=0\boldsymbol{d d}=0dd=0.
General 2-Form σ = 1 2 σ α β d x α d x β σ = 1 2 σ α β d x α d x β sigma=(1)/(2)sigma_(alpha beta)dx^(alpha)^^dx^(beta)\boldsymbol{\sigma}=\frac{1}{2} \sigma_{\alpha \beta} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta}σ=12σαβdxαdxβ, with σ α β = σ β α σ α β = σ β α sigma_(alpha beta)=-sigma_(beta alpha)\sigma_{\alpha \beta}=-\sigma_{\beta \alpha}σαβ=σβα
Again, this is a honeycomb-like structure, and again a machine to get a number (number of tubes, σ , u v σ , u v (:sigma,u^^v:)\langle\boldsymbol{\sigma}, \boldsymbol{u} \wedge \boldsymbol{v}\rangleσ,uv ) out of a surface ( u v u v u^^v\boldsymbol{u} \wedge \boldsymbol{v}uv ) that has a sense of circulation indicated on it. It is general in the sense that the honeycomb structures in one region do not ordinarily mesh with those
in a neighboring region. In consequence, a closed 2 -surface, such as the box-like surface indicated by dotted lines at the right, is ordinarily crossed by a non-zero net number of tubes. The net number of tubes emerging from such a closed surface is, however, exactly zero when the 2 -form is the exterior derivative of a 1 -form.
From 2-Form to 3-Form μ = d σ = σ | α β | x γ d x γ d x α d x β μ = d σ = σ | α β | x γ d x γ d x α d x β mu=d sigma=(delsigma_(|alpha beta|))/(delx^(gamma))dx^(gamma)^^dx^(alpha)^^dx^(beta)\boldsymbol{\mu}=\boldsymbol{d} \boldsymbol{\sigma}=\frac{\partial \sigma_{|\alpha \beta|}}{\partial x^{\gamma}} \boldsymbol{d} x^{\gamma} \wedge \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta}μ=dσ=σ|αβ|xγdxγdxαdxβ,
where d x γ d x α d x β 3 ! d x [ γ d x α d x β ] d x γ d x α d x β 3 ! d x [ γ d x α d x β ] dx^(gamma)^^dx^(alpha)^^dx^(beta)-=3!dx^([gamma)ox dx^(alpha)ox dx^(beta])\boldsymbol{d} x^{\gamma} \wedge \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \equiv 3!\boldsymbol{d} x^{[\gamma} \otimes \boldsymbol{d} x^{\alpha} \otimes \boldsymbol{d} x^{\beta]}dxγdxαdxβ3!dx[γdxαdxβ]
This egg-crate type of structure is a machine to get a number (number of cells μ , u v w μ , u v w (:mu,u^^v^^w:)\langle\boldsymbol{\mu}, \boldsymbol{u} \wedge \boldsymbol{v} \wedge \boldsymbol{w}\rangleμ,uvw ) from a volume (volume u v w u v w u^^v^^w\boldsymbol{u} \wedge \boldsymbol{v} \wedge \boldsymbol{w}uvw within which one counts the cells). A more complete diagram would provide each cell and the volume of integration itself with an indicator of orientation (analogous to the arrow of circulation shown for cells of the 2 -form). The contribution of a given cell to the count of cells is +1 or -1 , according as the orientation indicators have same sense or opposite sense. The number of egg-crate cells of μ = d σ μ = d σ mu=d sigma\boldsymbol{\mu}=\boldsymbol{d} \boldsymbol{\sigma}μ=dσ in any given volume (such as the volume indicated by the dotted lines) is tailored to give precisely the same number as the net number of tubes of the 2 -form σ σ sigma\boldsymbol{\sigma}σ (diagram above) that emerge
from that volume (generalized Stokes theorem). For electromagnetism, the exterior derivative of Faraday or 2 -form F F F\boldsymbol{F}F gives a null 3 -form, but the exterior derivative of Maxwell or 2 -form *F gives 4 π 4 π 4pi4 \pi4π times the 3 -form J J **J* \boldsymbol{J}J of charge:
J = ρ d x d y d z J x d t d y d z J y d t d z d x J z d t d x d y J = ρ d x d y d z J x d t d y d z J y d t d z d x J z d t d x d y {:[**J=rho d,x^^dy^^dz-J_(x)dt^^dy^^dz],[,-J_(y)dt^^dz^^dx-J_(z)dt^^dx^^dy]:}\begin{array}{rl} * \boldsymbol{J}=\rho \boldsymbol{d} & x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z-J_{x} \boldsymbol{d} t \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & -J_{y} \boldsymbol{d} t \wedge \boldsymbol{d} z \wedge \boldsymbol{d} x-J_{z} \boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y \end{array}J=ρdxdydzJxdtdydzJydtdzdxJzdtdxdy
Box 4.4 (continued)
From 1-Form to 2-Form to 3-Form? No!
Starting with a 1 -form (electromagnetic 4-potential), construct its exterior derivative, the 2 -form F = d A F = d A F=dA\boldsymbol{F}=\boldsymbol{d A}F=dA (Faraday). The tubes in this honeycomb-like structure never end. So the number of tube endings in any elementary volume, and with it the 3 -form d F = d d A d F = d d A dF=ddA\boldsymbol{d} \boldsymbol{F}=\boldsymbol{d} \boldsymbol{d} \boldsymbol{A}dF=ddA, is automatically zero. This is another example of the general result that d d = 0 d d = 0 dd=0\boldsymbol{d} \boldsymbol{d}=0dd=0.

From 2-Form to 3-Form to 4-Form? No!

Starting with 2 -form * F F F\boldsymbol{F}F (Maxwell), construct its exterior derivative, the 3-form 4 π J 4 π J 4pi**J4 \pi * \boldsymbol{J}4πJ. The cells in this egg-crate type of structure extend in a fourth dimension ("hypertube"). The number of these hypertubes that end in any elementary 4 -volume, and with it the 4 -form
d ( 4 π J ) = d d F d 4 π J = d d F d(4pi^(**)J)=dd^(**)F\boldsymbol{d}\left(4 \pi^{*} \boldsymbol{J}\right)=\boldsymbol{d} \boldsymbol{d}^{*} \boldsymbol{F}d(4πJ)=ddF
is automatically zero, still another example of the general result that d d = 0 d d = 0 dd=0\boldsymbol{d} \boldsymbol{d}=0dd=0. This result says that
d J = ( ρ t + J x x + J y y + J z z ) d t d x d y d z = 0 d J = ρ t + J x x + J y y + J z z d t d x d y d z = 0 d^(**)J=((del rho)/(del t)+(delJ_(x))/(del x)+(delJ_(y))/(del y)+(delJ_(z))/(del z))dt^^dx^^dy^^dz=0\boldsymbol{d}^{*} \boldsymbol{J}=\left(\frac{\partial \rho}{\partial t}+\frac{\partial J_{x}}{\partial x}+\frac{\partial J_{y}}{\partial y}+\frac{\partial J_{z}}{\partial z}\right) \boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z=0dJ=(ρt+Jxx+Jyy+Jzz)dtdxdydz=0
("law of conservation of charge"). Note:
d x α d x β d x γ d x δ 4 ! d x [ α d x β d x γ d x δ ] d x α d x β d x γ d x δ 4 ! d x [ α d x β d x γ d x δ ] dx^(alpha)^^dx^(beta)^^dx^(gamma)^^dx^(delta)-=4!dx^([alpha)ox dx^(beta)ox dx^(gamma)ox dx^(delta])\boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma} \wedge \boldsymbol{d} x^{\delta} \equiv 4!\boldsymbol{d} x^{[\alpha} \otimes \boldsymbol{d} x^{\beta} \otimes \boldsymbol{d} x^{\gamma} \otimes \boldsymbol{d} x^{\delta]}dxαdxβdxγdxδ4!dx[αdxβdxγdxδ]
This implies d t d x d y d z = ε d t d x d y d z = ε dt^^dx^^dy^^dz=epsi\boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z=\boldsymbol{\varepsilon}dtdxdydz=ε.
From 3-Form to 4-Form τ = d ν = ν | α β γ | x δ d x δ d x α d x β d x γ τ = d ν = ν | α β γ | x δ d x δ d x α d x β d x γ tau=d nu=(delnu_(|alpha beta gamma|))/(delx^(delta))dx^(delta)^^dx^(alpha)^^dx^(beta)^^dx^(gamma)\boldsymbol{\tau}=\boldsymbol{d} \nu=\frac{\partial \nu_{|\alpha \beta \gamma|}}{\partial x^{\delta}} \boldsymbol{d} x^{\delta} \wedge \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma}τ=dν=ν|αβγ|xδdxδdxαdxβdxγ
This four-dimensional "super-egg-crate" type structure is a machine to get a number (number of cells, r , n u v w r , n u v w (:r,n^^u^^v^^w:)\langle\boldsymbol{r}, \boldsymbol{n} \wedge \boldsymbol{u} \wedge \boldsymbol{v} \wedge \boldsymbol{w}\rangler,nuvw ) from a 4-volume n u v w n u v w n^^u^^v^^w\boldsymbol{n} \wedge \boldsymbol{u} \wedge \boldsymbol{v} \wedge \boldsymbol{w}nuvw.

From 4-Form to 5-Form? No!

Spacetime, being four-dimensional, cannot accommodate five-dimensional egg-crate structures. At least two of the d x μ d x μ dx^(mu)\boldsymbol{d} x^{\mu}dxμ 's in
d x α d x β d x γ d x δ d x ε d x α d x β d x γ d x δ d x ε dx^(alpha)^^dx^(beta)^^dx^(gamma)^^dx^(delta)^^dx^(epsi)\boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma} \wedge \boldsymbol{d} x^{\delta} \wedge \boldsymbol{d} x^{\varepsilon}dxαdxβdxγdxδdxε
must be the same; so, by antisymmetry of " ^^\wedge," this "basis 5 -form" must vanish.

Results of Exterior Differentiation, Summarized

0-form f f fff
1-form d f d f df\boldsymbol{d} fdf A A A\boldsymbol{A}A
2-form d d f 0 d d f 0 ddf-=0\boldsymbol{d} \boldsymbol{d} f \equiv 0ddf0 F = d A F = d A F=dA\boldsymbol{F}=\boldsymbol{d} \boldsymbol{A}F=dA F F ^(**)F{ }^{*} \boldsymbol{F}F
3-form d F = d d A 0 d F = d d A 0 dF=ddA-=0\boldsymbol{d} \boldsymbol{F}=\boldsymbol{d} \boldsymbol{d} \boldsymbol{A} \equiv 0dF=ddA0 4 π J = d F 4 π J = d F 4pi^(**)J=d^(**)F4 \pi^{*} \boldsymbol{J}=\boldsymbol{d}^{*} \boldsymbol{F}4πJ=dF ν ν nu\boldsymbol{\nu}ν
4-form d ( 4 π J ) = d d F 0 d 4 π J = d d F 0 d(4pi^(**)J)=dd^(**)F-=0\boldsymbol{d}\left(4 \pi^{*} \boldsymbol{J}\right)=\boldsymbol{d} \boldsymbol{d}^{*} \boldsymbol{F} \equiv 0d(4πJ)=ddF0 τ = d ν τ = d ν tau=d nu\boldsymbol{\tau}=\boldsymbol{d} \boldsymbol{\nu}τ=dν μ μ mu\boldsymbol{\mu}μ
5-form? No! d m 0 d m 0 dm-=0\boldsymbol{d} \boldsymbol{m} \equiv 0dm0 d μ 0 d μ 0 d mu-=0\boldsymbol{d} \mu \equiv 0dμ0
0-form f 1-form df A 2-form ddf-=0 F=dA ^(**)F 3-form dF=ddA-=0 4pi^(**)J=d^(**)F nu 4-form d(4pi^(**)J)=dd^(**)F-=0 tau=d nu mu 5-form? No! dm-=0 d mu-=0| 0-form | $f$ | | | | | | :--- | :---: | :---: | :---: | :---: | :---: | | 1-form | $\boldsymbol{d} f$ | $\boldsymbol{A}$ | | | | | 2-form | $\boldsymbol{d} \boldsymbol{d} f \equiv 0$ | $\boldsymbol{F}=\boldsymbol{d} \boldsymbol{A}$ | ${ }^{*} \boldsymbol{F}$ | | | | 3-form | | $\boldsymbol{d} \boldsymbol{F}=\boldsymbol{d} \boldsymbol{d} \boldsymbol{A} \equiv 0$ | $4 \pi^{*} \boldsymbol{J}=\boldsymbol{d}^{*} \boldsymbol{F}$ | $\boldsymbol{\nu}$ | | | 4-form | | | $\boldsymbol{d}\left(4 \pi^{*} \boldsymbol{J}\right)=\boldsymbol{d} \boldsymbol{d}^{*} \boldsymbol{F} \equiv 0$ | $\boldsymbol{\tau}=\boldsymbol{d} \boldsymbol{\nu}$ | $\boldsymbol{\mu}$ | | 5-form? | No! | | | $\boldsymbol{d} \boldsymbol{m} \equiv 0$ | $\boldsymbol{d} \mu \equiv 0$ |

New Forms from Old by Taking Dual (see exercise 3.14)

Dual of scalar f f fff is 4-form: f = f d x 0 d x 1 d x 2 d x 3 = f ε f = f d x 0 d x 1 d x 2 d x 3 = f ε ^(**)f=fdx^(0)^^dx^(1)^^dx^(2)^^dx^(3)=f epsi{ }^{*} f=f \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3}=f \varepsilonf=fdx0dx1dx2dx3=fε.
Dual of 1-form J J J\boldsymbol{J}J is 3-form: J = J 0 d x 1 d x 2 d x 3 J 1 d x 2 d x 3 d x 0 J = J 0 d x 1 d x 2 d x 3 J 1 d x 2 d x 3 d x 0 ^(**)J=J^(0)dx^(1)^^dx^(2)^^dx^(3)-J^(1)dx^(2)^^dx^(3)^^dx^(0){ }^{*} \boldsymbol{J}=J^{0} \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3}-J^{1} \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3} \wedge \boldsymbol{d} x^{0}J=J0dx1dx2dx3J1dx2dx3dx0 + J 2 d x 3 d x 0 d x 1 J 3 d x 0 d x 1 d x 2 + J 2 d x 3 d x 0 d x 1 J 3 d x 0 d x 1 d x 2 +J^(2)dx^(3)^^dx^(0)^^dx^(1)-J^(3)dx^(0)^^dx^(1)^^dx^(2)+J^{2} \boldsymbol{d} x^{3} \wedge \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{1}-J^{3} \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2}+J2dx3dx0dx1J3dx0dx1dx2.
Dual of 2-form F F F\boldsymbol{F}F is 2-form: F = F | α β | ε α β | μ ν | d x μ d x p F = F | α β | ε α β | μ ν | d x μ d x p ^(**)F=F^(|alpha beta|)epsi_(alpha beta|mu nu|)dx^(mu)^^dx^(p){ }^{*} \boldsymbol{F}=F^{|\alpha \beta|} \varepsilon_{\alpha \beta|\mu \nu|} \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{p}F=F|αβ|εαβ|μν|dxμdxp, where F α β = η α λ η β δ F λ δ F α β = η α λ η β δ F λ δ F^(alpha beta)=eta^(alpha lambda)eta^(beta delta)F_(lambda delta)F^{\alpha \beta}=\eta^{\alpha \lambda} \eta^{\beta \delta} F_{\lambda \delta}Fαβ=ηαληβδFλδ.
Dual of 3-form K K K\boldsymbol{K}K is 1-form: K = K 012 d x 3 K 123 d x 0 + K 230 d x 1 K 301 d x 2 K = K 012 d x 3 K 123 d x 0 + K 230 d x 1 K 301 d x 2 ^(**)K=K^(012)dx^(3)-K^(123)dx^(0)+K^(230)dx^(1)-K^(301)dx^(2){ }^{*} \boldsymbol{K}=K^{012} \boldsymbol{d} x^{3}-K^{123} \boldsymbol{d} x^{0}+K^{230} \boldsymbol{d} x^{1}-K^{301} \boldsymbol{d} x^{2}K=K012dx3K123dx0+K230dx1K301dx2, where K α β γ = η α μ η β ν η γ λ K μ ν λ K α β γ = η α μ η β ν η γ λ K μ ν λ K^(alpha beta gamma)=eta^(alpha mu)eta^(beta^(nu)eta^(gamma lambda))K_(mu nu lambda)K^{\alpha \beta \gamma}=\eta^{\alpha \mu} \eta^{\beta{ }^{\nu} \eta^{\gamma \lambda}} K_{\mu \nu \lambda}Kαβγ=ηαμηβνηγλKμνλ.
Dual of 4-form L L L\boldsymbol{L}L is a scalar: L = L 0123 d x 0 d x 1 d x 2 d x 3 L = L 0123 d x 0 d x 1 d x 2 d x 3 L=L_(0123)dx^(0)^^dx^(1)^^dx^(2)^^dx^(3)\boldsymbol{L}=L_{0123} \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3}L=L0123dx0dx1dx2dx3;
L = L 0123 = L 0123 . L = L 0123 = L 0123 . ^(**)L=L^(0123)=-L_(0123).{ }^{*} \boldsymbol{L}=L^{0123}=-L_{0123} .L=L0123=L0123.
Note 1: This concept of duality between one form and another is to be distinguished from the concept of duality between the vector basis e α e α e_(alpha)\boldsymbol{e}_{\alpha}eα and the 1-form basis ω α ω α omega^(alpha)\boldsymbol{\omega}^{\alpha}ωα of a given frame. The two types of duality have nothing whatsoever to do with each other!
Box 4.4 (continued)
Note 2: In spacetime, the operation of taking the dual, applied twice, leads back to the original form for forms of odd order, and to the negative thereof for forms of even order. In Euclidean 3-space the operation reproduces the original form, regardless of its order.

Duality Plus Exterior Differentiation

Start with scalar ϕ ϕ phi\phiϕ. Its gradient d ϕ d ϕ d phi\boldsymbol{d} \phidϕ is a 1 -form. Take its dual, to get the 3 -form * d ϕ d ϕ d phi\boldsymbol{d} \phidϕ. Take its exterior derivative, to get the 4 -form d d ϕ d d ϕ d^(**)d phi\boldsymbol{d}^{*} \boldsymbol{d} \phiddϕ. Take its dual, to get the scalar ϕ d d d ϕ d d d ◻phi-=-^(**)d**dd\square \phi \equiv-{ }^{*} \boldsymbol{d} \boldsymbol{*} \boldsymbol{d} \boldsymbol{d}ϕddd. Verify by index manipulations that \square as defined here is the wave operator; i.e., in any Lorentz frame, ϕ = ϕ , α , α = ( 2 ϕ / t 2 ) + 2 ϕ ϕ = ϕ , α , α = 2 ϕ / t 2 + 2 ϕ ◻phi=phi_(,alpha^(,alpha))=-(del^(2)phi//delt^(2))+grad^(2)phi\square \phi=\phi_{, \alpha^{, \alpha}}=-\left(\partial^{2} \phi / \partial t^{2}\right)+\nabla^{2} \phiϕ=ϕ,α,α=(2ϕ/t2)+2ϕ.
Start with 1 -form A A A\boldsymbol{A}A. Get 2 -form F = d A F = d A F=dA\boldsymbol{F}=\boldsymbol{d} \boldsymbol{A}F=dA. Take its dual F = d A F = d A ^(**)F=^(**)dA{ }^{\boldsymbol{*}} \boldsymbol{F}={ }^{\boldsymbol{*}} \boldsymbol{d} \boldsymbol{A}F=dA, also a 2 -form. Take its exterior derivative, obtaining the 3 -form d F d F d^(**)F\boldsymbol{d}^{*} \boldsymbol{F}dF (has value 4 π J 4 π J 4pi^(**)J4 \pi^{*} \boldsymbol{J}4πJ in electromagnetism). Take its dual, obtaining the 1 -form d F = d d d A = 4 π J d F = d d d A = 4 π J ^(**)d^(**)F=^(**)dd^(**)dA=4pi J{ }^{*} \boldsymbol{d}^{*} \boldsymbol{F}={ }^{*} \boldsymbol{d} \boldsymbol{\boldsymbol { d } ^ { * } \boldsymbol { d } \boldsymbol { A }}=4 \pi \boldsymbol{J}dF=dddA=4πJ ("Wave equation for electromagnetic 4-potential"). Reduce in index notation to
F μ ν , ν = A ν , μ , ν A μ , v , ν = 4 π J μ . F μ ν , ν = A ν , μ , ν A μ , v , ν = 4 π J μ . F_(mu nu)^(,nu)=A_(nu,mu)^(,nu)-A_(mu,v^(,nu))=4piJ_(mu).F_{\mu \nu}^{, \nu}=A_{\nu, \mu}{ }^{, \nu}-A_{\mu, v^{, \nu}}=4 \pi J_{\mu} .Fμν,ν=Aν,μ,νAμ,v,ν=4πJμ.
[More in Flanders (1963) or Misner and Wheeler (1957); see also exercise 3.17.]

§4.7. DISTANT ACTION FROM LOCAL LAW

Differential forms are a powerful tool in electromagnetic theory, but full power requires mastery of other tools as well. Action-at-a-distance techniques ("Green's functions," "propagators") are of special importance. Moreover, the passage from Maxwell field equations to electromagnetic action at a distance provides a preview of how Einstein's local equations will reproduce (approximately) Newton's 1 / r 2 1 / r 2 1//r^(2)1 / r^{2}1/r2 law.
In flat spacetime and in a Lorentz coordinate system, express the coordinates of particle A A AAA as a function of its proper time α α alpha\alphaα, thus:
(4.32) a μ = a μ ( α ) , d a μ d α = a ˙ μ ( α ) , d 2 a μ d α 2 = a ¨ μ ( α ) . (4.32) a μ = a μ ( α ) , d a μ d α = a ˙ μ ( α ) , d 2 a μ d α 2 = a ¨ μ ( α ) . {:(4.32)a^(mu)=a^(mu)(alpha)","quad(da^(mu))/(d alpha)=a^(˙)^(mu)(alpha)","quad(d^(2)a^(mu))/(dalpha^(2))=a^(¨)^(mu)(alpha).:}\begin{equation*} a^{\mu}=a^{\mu}(\alpha), \quad \frac{d a^{\mu}}{d \alpha}=\dot{a}^{\mu}(\alpha), \quad \frac{d^{2} a^{\mu}}{d \alpha^{2}}=\ddot{a}^{\mu}(\alpha) . \tag{4.32} \end{equation*}(4.32)aμ=aμ(α),daμdα=a˙μ(α),d2aμdα2=a¨μ(α).
Dirac found it helpful to express the distribution of charge and current for a particle of charge e e eee following such a motion as a superposition of charges that momentarily
flash into existence and then flash out of existence. Any such flash has a localization in space and time that can be written as the product of four Dirac delta functions [see, for example, Schwartz (1950-1951), Lighthill (1958)]:
(4.33) δ 4 ( x μ a μ ) = δ [ x 0 a 0 ( α ) ] δ [ x 1 a 1 ( α ) ] δ [ x 2 a 2 ( α ) ] δ [ x 3 a 3 ( α ) ] . (4.33) δ 4 x μ a μ = δ x 0 a 0 ( α ) δ x 1 a 1 ( α ) δ x 2 a 2 ( α ) δ x 3 a 3 ( α ) . {:(4.33)delta^(4)(x^(mu)-a^(mu))=delta[x^(0)-a^(0)(alpha)]delta[x^(1)-a^(1)(alpha)]delta[x^(2)-a^(2)(alpha)]delta[x^(3)-a^(3)(alpha)].:}\begin{equation*} \delta^{4}\left(x^{\mu}-a^{\mu}\right)=\delta\left[x^{0}-a^{0}(\alpha)\right] \delta\left[x^{1}-a^{1}(\alpha)\right] \delta\left[x^{2}-a^{2}(\alpha)\right] \delta\left[x^{3}-a^{3}(\alpha)\right] . \tag{4.33} \end{equation*}(4.33)δ4(xμaμ)=δ[x0a0(α)]δ[x1a1(α)]δ[x2a2(α)]δ[x3a3(α)].
Here any single Dirac function δ ( x ) δ ( x ) delta(x)\delta(x)δ(x) ("symbolic function"; "distribution"; "limit of a Gauss error function" as width is made indefinitely narrow and peak indefinitely high, with integrated value always unity) both (1) vanishes for x 0 x 0 x!=0x \neq 0x0, and (2) has the integral + δ ( x ) d x = 1 + δ ( x ) d x = 1 int_(-oo)^(+oo)delta(x)dx=1\int_{-\infty}^{+\infty} \delta(x) d x=1+δ(x)dx=1. Described in these terms, the density-current vector for the particle has the value ("superposition of flashes")
(4.34) J μ = e δ 4 [ x ν a ν ( α ) ] a ˙ μ ( α ) d α (4.34) J μ = e δ 4 x ν a ν ( α ) a ˙ μ ( α ) d α {:(4.34)J^(mu)=e intdelta^(4)[x^(nu)-a^(nu)(alpha)]a^(˙)^(mu)(alpha)d alpha:}\begin{equation*} J^{\mu}=e \int \delta^{4}\left[x^{\nu}-a^{\nu}(\alpha)\right] \dot{a}^{\mu}(\alpha) d \alpha \tag{4.34} \end{equation*}(4.34)Jμ=eδ4[xνaν(α)]a˙μ(α)dα
The density-current (4.34) drives the electromagnetic field, F F F\boldsymbol{F}F. Write F = d A F = d A F=dA\boldsymbol{F}=\boldsymbol{d} \boldsymbol{A}F=dA to satisfy automatically half of Maxwell's equations ( d F = d d A 0 ) ( d F = d d A 0 ) (dF=ddA-=0)(\boldsymbol{d} \boldsymbol{F}=\boldsymbol{d} \boldsymbol{d} \boldsymbol{A} \equiv 0)(dF=ddA0) :
(4.35) F μ α = A α x μ A μ x α . (4.35) F μ α = A α x μ A μ x α . {:(4.35)F_(mu alpha)=(delA_(alpha))/(delx^(mu))-(delA_(mu))/(delx^(alpha)).:}\begin{equation*} F_{\mu \alpha}=\frac{\partial A_{\alpha}}{\partial x^{\mu}}-\frac{\partial A_{\mu}}{\partial x^{\alpha}} . \tag{4.35} \end{equation*}(4.35)Fμα=AαxμAμxα.
In flat space, the remainder of Maxwell's equations ( d F = 4 π J ) d F = 4 π J (d^(**)F=4pi^(**)J)\left(\boldsymbol{d}^{*} \boldsymbol{F}=4 \pi^{*} \boldsymbol{J}\right)(dF=4πJ) become
F μ v x v = 4 π J μ F μ v x v = 4 π J μ (delF_(mu)^(v))/(delx^(v))=4piJ_(mu)\frac{\partial F_{\mu}{ }^{v}}{\partial x^{v}}=4 \pi J_{\mu}Fμvxv=4πJμ
or
(4.36) x μ A ν x ν η ν α 2 A μ x ν x α = 4 π J μ . (4.36) x μ A ν x ν η ν α 2 A μ x ν x α = 4 π J μ . {:(4.36)(del)/(delx^(mu))(delA^(nu))/(delx^(nu))-eta^(nu alpha)(del^(2)A_(mu))/(delx^(nu)delx^(alpha))=4piJ_(mu).:}\begin{equation*} \frac{\partial}{\partial x^{\mu}} \frac{\partial A^{\nu}}{\partial x^{\nu}}-\eta^{\nu \alpha} \frac{\partial^{2} A_{\mu}}{\partial x^{\nu} \partial x^{\alpha}}=4 \pi J_{\mu} . \tag{4.36} \end{equation*}(4.36)xμAνxνηνα2Aμxνxα=4πJμ.
Make use of the freedom that exists in the choice of 4-potentials A ν A ν A^(nu)A^{\nu}Aν to demand
(4.37) A v x y = 0 (4.37) A v x y = 0 {:(4.37)(delA^(v))/(delx^(y))=0:}\begin{equation*} \frac{\partial A^{v}}{\partial x^{y}}=0 \tag{4.37} \end{equation*}(4.37)Avxy=0
(Lorentz gauge condition; see exercise 3.17). Thus get
(4.38) 1 A μ = 4 π J μ (4.38) 1 A μ = 4 π J μ {:(4.38)1A_(mu)=-4piJ_(mu):}\begin{equation*} 1 A_{\mu}=-4 \pi J_{\mu} \tag{4.38} \end{equation*}(4.38)1Aμ=4πJμ
The density-current being the superposition of "flashes," the effect ( A ) ( A ) (A)(\boldsymbol{A})(A) of this density-current can be expressed as the superposition of the effects E E EEE of elementary flashes; thus
(4.39) A μ ( x ) = E [ x a ( α ) ] a ˙ μ ( α ) d α (4.39) A μ ( x ) = E [ x a ( α ) ] a ˙ μ ( α ) d α {:(4.39)A^(mu)(x)=int E[x-a(alpha)]a^(˙)^(mu)(alpha)d alpha:}\begin{equation*} A^{\mu}(x)=\int E[x-a(\alpha)] \dot{a}^{\mu}(\alpha) d \alpha \tag{4.39} \end{equation*}(4.39)Aμ(x)=E[xa(α)]a˙μ(α)dα
where the "elementary effect" E E EEE ("kernel"; "Green's function") satisfies the equation
(4.40) E ( x ) = 4 π δ 4 ( x ) . (4.40) E ( x ) = 4 π δ 4 ( x ) . {:(4.40)◻E(x)=-4pidelta^(4)(x).:}\begin{equation*} \square E(x)=-4 \pi \delta^{4}(x) . \tag{4.40} \end{equation*}(4.40)E(x)=4πδ4(x).
One solution is the "half-advanced-plus-half-retarded potential,"
(4.41) E ( x ) = δ ( η α β x α x β ) . (4.41) E ( x ) = δ η α β x α x β . {:(4.41)E(x)=delta(eta_(alpha beta)x^(alpha)x^(beta)).:}\begin{equation*} E(x)=\delta\left(\eta_{\alpha \beta} x^{\alpha} x^{\beta}\right) . \tag{4.41} \end{equation*}(4.41)E(x)=δ(ηαβxαxβ).
World line of charge regarded as succession of flash-on, flash-off charges
It vanishes everywhere except on the backward and forward light cones, where it has equal strength. Normally more useful is the retarded solution,
(4.42) R ( x ) = { 2 E ( x ) if x 0 > 0 , 0 if x 0 < 0 (4.42) R ( x ) = 2 E ( x )  if  x 0 > 0 , 0  if  x 0 < 0 {:(4.42)R(x)={[2E(x)," if "x^(0) > 0","],[0," if "x^(0) < 0]:}:}R(x)= \begin{cases}2 E(x) & \text { if } x^{0}>0, \tag{4.42}\\ 0 & \text { if } x^{0}<0\end{cases}(4.42)R(x)={2E(x) if x0>0,0 if x0<0
which is obtained by doubling (4.41) in the region of the forward light cone and nullifying it in the region of the backward light cone. All electrodynamics (Coulomb forces, Ampère's law, electromagnetic induction, radiation) follows from the simple expression (4.39) for the vector potential [see, e.g., Wheeler and Feynman (1945) and (1949), also Rohrlich (1965)].

EXERCISES

Exercise 4.1. GENERIC LOCAL ELECTROMAGNETIC FIELD EXPRESSED IN SIMPLEST FORM

In the laboratory Lorentz frame, the electric field is E E E\boldsymbol{E}E, the magnetic field B B B\boldsymbol{B}B. Special cases are: (1) pure electric field ( B = 0 ) ( B = 0 ) (B=0)(\boldsymbol{B}=0)(B=0); (2) pure magnetic field ( E = 0 E = 0 E=0\boldsymbol{E}=0E=0 ); and (3) "radiation field" or "null field" ( E E E\boldsymbol{E}E and B B B\boldsymbol{B}B equal in magnitude and perpendicular in direction). All cases other than (1), (2), and (3) are "generic." In the generic case, calculate the Poynting density of flow of energy E × B / 4 π E × B / 4 π E xx B//4pi\boldsymbol{E} \times \boldsymbol{B} / 4 \piE×B/4π and the density of energy ( E 2 + B 2 ) / 8 π E 2 + B 2 / 8 π (E^(2)+B^(2))//8pi\left(\boldsymbol{E}^{2}+\boldsymbol{B}^{2}\right) / 8 \pi(E2+B2)/8π. Define the direction of a unit vector n n n\boldsymbol{n}n and the magnitude of a velocity parameter α α alpha\alphaα by the ratio of energy flow to energy density:
n tanh 2 α = 2 E × B E 2 + B 2 . n tanh 2 α = 2 E × B E 2 + B 2 . n tanh 2alpha=(2E xx B)/(E^(2)+B^(2)).\boldsymbol{n} \tanh 2 \alpha=\frac{2 \boldsymbol{E} \times \boldsymbol{B}}{\boldsymbol{E}^{2}+\boldsymbol{B}^{2}} .ntanh2α=2E×BE2+B2.
View the same electromagnetic field in a rocket frame moving in the direction of n n n\boldsymbol{n}n with the velocity parameter α α alpha\alphaα (not 2 α 2 α 2alpha2 \alpha2α; factor 2 comes in because energy flow and energy density are components, not of a vector, but of a tensor). By employing the formulas for a Lorentz transformation (equation 3.23), or otherwise, show that the energy flux vanishes in the rocket frame, with the consequence that E E ¯ bar(E)\overline{\boldsymbol{E}}E and B B ¯ bar(B)\overline{\boldsymbol{B}}B are parallel. No one can prevent the z ¯ z ¯ bar(z)\bar{z}z¯-axis from being put in the direction common to E E ¯ bar(E)\overline{\boldsymbol{E}}E and B B ¯ bar(B)\overline{\boldsymbol{B}}B. Show that with this choice of direction, Faraday becomes
F = E ¯ z d z ¯ d t ¯ + B ¯ z d x ¯ d y ¯ F = E ¯ z d z ¯ d t ¯ + B ¯ z d x ¯ d y ¯ F= bar(E)_(z)d bar(z)^^d bar(t)+ bar(B)_(z)d bar(x)^^d bar(y)\boldsymbol{F}=\bar{E}_{z} \boldsymbol{d} \bar{z} \wedge \boldsymbol{d} \bar{t}+\bar{B}_{z} \boldsymbol{d} \bar{x} \wedge \boldsymbol{d} \bar{y}F=E¯zdz¯dt¯+B¯zdx¯dy¯
(only two wedge products needed to represent the generic local field; "canonical representation"; valid in one frame, valid in any frame).

Exercise 4.2. FREEDOM OF CHOICE OF 1-FORMS IN CANONICAL representation of generic local field

Deal with a region so small that the variation of the field from place to place can be neglected. Write Faraday in canonical representation in the form
F = d p I d q I + d p I I d q I I F = d p I d q I + d p I I d q I I F=dp_(I)^^dq^(I)+dp_(II)^^dq^(II)\boldsymbol{F}=\boldsymbol{d} p_{I} \wedge \boldsymbol{d} q^{I}+\boldsymbol{d} p_{I I} \wedge \boldsymbol{d} q^{I I}F=dpIdqI+dpIIdqII
where p A ( A = I p A ( A = I p_(A)(A=Ip_{A}(A=IpA(A=I or I I ) I I ) II)I I)II) and q A q A q^(A)q^{A}qA are scalar functions of position in spacetime. Define a "canonical transformation" to new scalar functions of position p A ¯ p A ¯ p_( bar(A))p_{\bar{A}}pA¯ and q A ¯ q A ¯ q^( bar(A))q^{\bar{A}}qA¯ by way of the "equation of transformation"
p A d q A = d S + p A ¯ d q A ¯ , p A d q A = d S + p A ¯ d q A ¯ , p_(A)dq^(A)=dS+p_( bar(A))dq^( bar(A)),p_{A} \boldsymbol{d} q^{A}=\boldsymbol{d} S+p_{\bar{A}} \boldsymbol{d} q^{\bar{A}},pAdqA=dS+pA¯dqA¯,
Figure 4.7.
Some simple types of 1 -forms compared and contrasted.
where the "generating function" S S SSS of the transformation is an arbitrary function of the q A q A q^(A)q^{A}qA and the q A ¯ q A ¯ q^( bar(A))q^{\bar{A}}qA¯ :
d S = ( S / q A ) d q A + ( S / q A ¯ ) d q A ¯ d S = S / q A d q A + S / q A ¯ d q A ¯ dS=(del S//delq^(A))dq^(A)+(del S//delq^( bar(A)))dq^( bar(A))\boldsymbol{d} S=\left(\partial S / \partial q^{A}\right) \boldsymbol{d} q^{A}+\left(\partial S / \partial q^{\bar{A}}\right) \boldsymbol{d} q^{\bar{A}}dS=(S/qA)dqA+(S/qA¯)dqA¯
(a) Derive expressions for the two p A p A p_(A)p_{A}pA 's and the two p A ¯ p A ¯ p_( bar(A))p_{\bar{A}}pA¯ 's in terms of S S SSS by equating coefficients of d q I , d q I I , d q I , d q I I d q I , d q I I , d q I , d q I I dq^(I),dq^(II),dq^(I),dq^(II)d q^{I}, d q^{I I}, d q^{I}, d q^{I I}dqI,dqII,dqI,dqII individually on the two sides of the equation of transformation.
(b) Use these expressions for the p A p A p_(A)p_{A}pA 's and p A ¯ p A ¯ p_( bar(A))p_{\bar{A}}pA¯ 's to show that F = d p A d q A F = d p A d q A F=dp_(A)^^dq^(A)\boldsymbol{F}=\boldsymbol{d} p_{A} \wedge \boldsymbol{d} q^{A}F=dpAdqA and F = F ¯ = bar(F)=\overline{\boldsymbol{F}}=F= d p A ¯ d q A ¯ d p A ¯ d q A ¯ dp_( bar(A))^^dq^( bar(A))\boldsymbol{d} p_{\bar{A}} \wedge \boldsymbol{d} q^{\bar{A}}dpA¯dqA¯, ostensibly different, are actually expressions for one and the same 2 -form in terms of alternative sets of 1 -forms.

Exercise 4.3. A CLOSED OR CURL-FREE 1-FORM IS A GRADIENT

Given a 1 -form σ σ sigma\boldsymbol{\sigma}σ such that d σ = 0 d σ = 0 d sigma=0\boldsymbol{d} \boldsymbol{\sigma}=0dσ=0, show that σ σ sigma\boldsymbol{\sigma}σ can be expressed in the form σ = d f σ = d f sigma=df\boldsymbol{\sigma}=\boldsymbol{d} fσ=df, where f f fff is some scalar. The 1 -form σ σ sigma\boldsymbol{\sigma}σ is said to be "curl-free," a narrower category of 1 -form than the "rotation-free" 1-form of the next exercise (expressible as σ = h d f σ = h d f sigma=hdf\boldsymbol{\sigma}=h \boldsymbol{d} fσ=hdf ), and it in turn is narrower (see Figure 4.7) than the category of " 1 -forms with rotation" (not expressible in the form σ = h d f σ = h d f sigma=hdf\boldsymbol{\sigma}=h \boldsymbol{d} fσ=hdf ). When the 1 -form σ σ sigma\boldsymbol{\sigma}σ is expressed in terms of basis 1-forms d x α d x α dx^(alpha)\boldsymbol{d} x^{\alpha}dxα, multiplied by corresponding components σ α σ α sigma_(alpha)\sigma_{\alpha}σα, show that "curl-free" implies σ [ α , β ] = 0 σ [ α , β ] = 0 sigma_([alpha,beta])=0\sigma_{[\alpha, \beta]}=0σ[α,β]=0.

Exercise 4.4. CANONICAL EXPRESSION FOR A ROTATION-FREE 1-FORM

In three dimensions a rigid body turning with angular velocity ω ω omega\omegaω about the z z zzz-axis has components of velocity v y = ω x v y = ω x v_(y)=omega xv_{y}=\omega xvy=ωx, and v x = ω y v x = ω y v_(x)=-omega yv_{x}=-\omega yvx=ωy. The quantity curl v = × v v = × v v=grad xx vv=\nabla \times vv=×v has z z zzz-component equal to 2 ω 2 ω 2omega2 \omega2ω, and all other components equal to zero. Thus the scalar product of v v vvv and curl v v v\boldsymbol{v}v vanishes:
v [ i , j v k ] = 0 v [ i , j v k ] = 0 v_([i,j)v_(k])=0v_{[i, j} v_{k]}=0v[i,jvk]=0
The same concept generalizes to four dimensions,
v [ α , β v γ ] = 0 v [ α , β v γ ] = 0 v_([alpha,beta)v_(gamma])=0v_{[\alpha, \beta} v_{\gamma]}=0v[α,βvγ]=0
and lends itself to expression in coordinate-free language, as the requirement that a certain 3 -form must vanish:
d v v = 0 d v v = 0 dv^^v=0\boldsymbol{d} \boldsymbol{v} \wedge \boldsymbol{v}=0dvv=0
Any 1-form v v v\boldsymbol{v}v satisfying this condition is said to be "rotation-free." Show that a 1 -form is rotation-free if and only if it can be written in the form
v = h d f v = h d f v=hdf\boldsymbol{v}=h \boldsymbol{d} fv=hdf
where h h hhh and f f fff are scalar functions of position (the "Frobenius theorem").

Exercise 4.5. FORMS ENDOWED WITH POLAR SINGULARITIES

List the principal results on how such forms are representable, such as
Φ 1 = d S S ψ 1 + θ 1 Φ 1 = d S S ψ 1 + θ 1 Phi_(1)=(dS)/(S)^^psi_(1)+theta_(1)\boldsymbol{\Phi}_{1}=\frac{\boldsymbol{d} S}{S} \wedge \boldsymbol{\psi}_{1}+\boldsymbol{\theta}_{1}Φ1=dSSψ1+θ1
and the conditions under which each applies [for the meaning and answer to this exercise, see Lascoux (1968)].

Exercise 4.6. THE FIELD OF THE OSCILLATING DIPOLE

Verify that the expressions given for the electromagnetic field of an oscillating dipole in equations (4.23) and (4.24) satisfy d F = 0 d F = 0 dF=0\boldsymbol{d} \boldsymbol{F}=0dF=0 everywhere and d F = 0 d F = 0 d^(**)F=0\boldsymbol{d}^{*} \boldsymbol{F}=0dF=0 everywhere except at the origin.

Exercise 4.7. THE 2-FORM MACHINERY TRANSLATED INTO TENSOR MACHINERY

This exercise is stated at the end of the legend caption of Figure 4.1.

Exercise 4.8. PANCAKING THE COULOMB FIELD

Figure 4.5 shows a spacelike slice, t = t = t=t=t= const, through the Maxwell of a point-charge at rest. By the following pictorial steps, verify that the electric-field lines get compressed into the transverse direction when viewed from a moving Lorentz frame: (1) Draw a picture of an equatorial slice ( θ = π / 2 ; t , r , ϕ θ = π / 2 ; t , r , ϕ theta=pi//2;t,r,phi\theta=\pi / 2 ; t, r, \phiθ=π/2;t,r,ϕ variable) through Maxwell = F = F =^(**)F={ }^{*} \boldsymbol{F}=F. (2) Draw various spacelike slices, corresponding to constant time in various Lorentz frames, through the resultant geometric structure. (3) Interpret the intersection of Maxwell = F = F =**F=\boldsymbol{*} \boldsymbol{F}=F with each Lorentz slice in the manner of Figure 4.3.

Exercise 4.9. COMPUTATION OF SURFACE INTEGRALS

In Box 4.1 the definition
α = α , P λ 1 P λ p d λ 1 d λ p α = α , P λ 1 P λ p d λ 1 d λ p int alpha=int dots int(:alpha,(delP)/(dellambda^(1))^^dots^^(delP)/(dellambda^(p)):)dlambda^(1)dots dlambda^(p)\int \boldsymbol{\alpha}=\int \ldots \int\left\langle\boldsymbol{\alpha}, \frac{\partial \mathscr{P}}{\partial \lambda^{1}} \wedge \ldots \wedge \frac{\partial \mathscr{P}}{\partial \lambda^{p}}\right\rangle d \lambda^{1} \ldots d \lambda^{p}α=α,Pλ1Pλpdλ1dλp
is given for the integral of a p p ppp-form α α alpha\boldsymbol{\alpha}α over a p p ppp-surface P ( λ 1 , , λ p ) P λ 1 , , λ p P(lambda^(1),dots,lambda^(p))\mathscr{P}\left(\lambda^{1}, \ldots, \lambda^{p}\right)P(λ1,,λp) in n n nnn-dimensional space. From this show that the following computational rule (also given in Box 4.1) works: (1) substitute the equation for the surface,
x k = x k ( λ 1 , , λ p ) x k = x k λ 1 , , λ p x^(k)=x^(k)(lambda^(1),dots,lambda^(p))x^{k}=x^{k}\left(\lambda^{1}, \ldots, \lambda^{p}\right)xk=xk(λ1,,λp)
into α α alpha\boldsymbol{\alpha}α and collect terms in the form
α = a ( λ 1 , , λ p ) d λ 1 d λ p α = a λ 1 , , λ p d λ 1 d λ p alpha=a(lambda^(1),dots,lambda^(p))dlambda^(1)^^cdots^^dlambda^(p)\boldsymbol{\alpha}=a\left(\lambda^{1}, \ldots, \lambda^{p}\right) \boldsymbol{d} \lambda^{1} \wedge \cdots \wedge \boldsymbol{d} \lambda^{p}α=a(λ1,,λp)dλ1dλp
(2) integrate
α = a ( λ 1 , , λ p ) d λ 1 d λ p α = a λ 1 , , λ p d λ 1 d λ p int alpha=int dots int a(lambda^(1),dots,lambda^(p))dlambda^(1)dots dlambda^(p)\int \alpha=\int \ldots \int a\left(\lambda^{1}, \ldots, \lambda^{p}\right) d \lambda^{1} \ldots d \lambda^{p}α=a(λ1,,λp)dλ1dλp
using the elementary definition of integration.

Exercise 4.10. WHITAKER'S CALUMOID, OR, THE LIFE OF A LOOP

Take a closed loop, bounding a 2 -dimensional surface S S SSS. It entraps a certain flux of Faraday Φ F = S F Φ F = S F Phi_(F)=int_(S)F\boldsymbol{\Phi}_{F}=\int_{S} \boldsymbol{F}ΦF=SF ("magnetic tubes") and a certain flux of Maxwell Φ M = S F Φ M = S F Phi_(M)=int_(S)**F\Phi_{M}=\int_{S} * \boldsymbol{F}ΦM=SF ("electric tubes").
(a) Show that the fluxes Φ F Φ F Phi_(F)\Phi_{F}ΦF and Φ M Φ M Phi_(M)\Phi_{M}ΦM depend only on the choice of loop, and not on the choice of the surface S S SSS bounded by the loop, if and only if d F = d F = 0 d F = d F = 0 dF=d^(**)F=0\boldsymbol{d} \boldsymbol{F}=\boldsymbol{d}^{*} \boldsymbol{F}=0dF=dF=0 (no magnetic charge; no electric charge). Hint: use generalized Stokes theorem, Boxes 4.1 and 4.6.
(b) Move the loop in space and time so that it continues to entrap the same two fluxes. Move it forward a little more here, a little less there, so that it continues to do so. In this way trace out a 2-dimensional surface ("calumoid"; see E. T. Whitaker 1904) P = P ( a , b ) P = P ( a , b ) P=P(a,b)\mathscr{P}=\mathscr{P}(a, b)P=P(a,b); x μ = x μ ( a , b ) x μ = x μ ( a , b ) x^(mu)=x^(mu)(a,b)x^{\mu}=x^{\mu}(a, b)xμ=xμ(a,b). Show that the elementary bivector in this surface, Σ = P / a P / b Σ = P / a P / b Sigma=delP//del a^^delP//del b\boldsymbol{\Sigma}=\partial \mathscr{P} / \partial a \wedge \partial \mathscr{P} / \partial bΣ=P/aP/b satisfies F , Σ = 0 F , Σ = 0 (:F,Sigma:)=0\langle\boldsymbol{F}, \boldsymbol{\Sigma}\rangle=0F,Σ=0 and F , Σ = 0 F , Σ = 0 (:**F,Sigma:)=0\langle\boldsymbol{*} \boldsymbol{F}, \boldsymbol{\Sigma}\rangle=0F,Σ=0.
(c) Show that these differential equations for x μ ( a , b ) x μ ( a , b ) x^(mu)(a,b)x^{\mu}(a, b)xμ(a,b) can possess a solution, with given initial condition x μ = x μ ( a , 0 ) x μ = x μ ( a , 0 ) x^(mu)=x^(mu)(a,0)x^{\mu}=x^{\mu}(a, 0)xμ=xμ(a,0) for the initial location of the loop, if d F = 0 d F = 0 dF=0\boldsymbol{d} \boldsymbol{F}=0dF=0 and d F = 0 d F = 0 d^(**)F=0\boldsymbol{d}^{*} \boldsymbol{F}=0dF=0 (no magnetic charge, no electric charge).
(d) Consider a static, uniform electric field F = E x d t d x F = E x d t d x F=-E_(x)dt^^dx\boldsymbol{F}=-E_{x} \boldsymbol{d} t \wedge \boldsymbol{d} xF=Exdtdx. Solve the equations, F , Σ = 0 F , Σ = 0 (:F,Sigma:)=0\langle\boldsymbol{F}, \boldsymbol{\Sigma}\rangle=0F,Σ=0 and F , Σ = 0 F , Σ = 0 (:^(**)F,Sigma:)=0\left\langle{ }^{*} \boldsymbol{F}, \boldsymbol{\Sigma}\right\rangle=0F,Σ=0 to find the equation P ( a , b ) P ( a , b ) P(a,b)\mathscr{P}(a, b)P(a,b) for the most general calumoid. [Answer: y = y ( a ) , z = z ( a ) , x = x ( b ) , t = t ( b ) y = y ( a ) , z = z ( a ) , x = x ( b ) , t = t ( b ) y=y(a),z=z(a),x=x(b),t=t(b)y=y(a), z=z(a), x=x(b), t=t(b)y=y(a),z=z(a),x=x(b),t=t(b).] Exhibit two special cases: (i) a calumoid that lies entirely in a hypersurface of constant time [loop moves at infinite velocity; analogous to super-light velocity of point of crossing for two blades of a pair of scissors]; (ii) a calumoid whose loop remains forever at rest in the t , x , y , z t , x , y , z t,x,y,zt, x, y, zt,x,y,z Lorentz frame.

Exercise 4.11. DIFFERENTIAL FORMS AND HAMILTONIAN MECHANICS

Consider a dynamic system endowed with two degrees of freedom. For the definition of this system as a Hamiltonian system (special case: here the Hamiltonian is independent of time), one needs (1) a definition of canonical variables (see Box 4.5) and (2) a knowledge of the Hamiltonian H H HHH as a function of the coordinates q 1 , q 2 q 1 , q 2 q^(1),q^(2)q^{1}, q^{2}q1,q2 and the canonically conjugate momenta p 1 , p 2 p 1 , p 2 p_(1),p_(2)p_{1}, p_{2}p1,p2. To derive the laws of mechanics, consider the five-dimensional space of p 1 , p 2 , q 1 , q 2 p 1 , p 2 , q 1 , q 2 p_(1),p_(2),q^(1),q^(2)p_{1}, p_{2}, q^{1}, q^{2}p1,p2,q1,q2, and t t ttt, and a curve in this space leading from starting values of the five coordinates (subscript A A AAA ) to final values (subscript B B BBB ), and the value
I = A B p 1 d q 1 + p 2 d q 2 H ( p , q ) d t = A B ω I = A B p 1 d q 1 + p 2 d q 2 H ( p , q ) d t = A B ω I=int_(A)^(B)p_(1)dq^(1)+p_(2)dq^(2)-H(p,q)dt=int_(A)^(B)omegaI=\int_{A}^{B} p_{1} \boldsymbol{d} q^{1}+p_{2} \boldsymbol{d} q^{2}-H(p, q) \boldsymbol{d} t=\int_{A}^{B} \boldsymbol{\omega}I=ABp1dq1+p2dq2H(p,q)dt=ABω
of the integral I I III taken along this path. The difference of the integral for two "neighboring" paths enclosing a two-dimensional region S S SSS, according to the theorem of Stokes (Boxes 4.1 and 4.6), is
δ I = f r ω = s d ω δ I = f r ω = s d ω delta I=f_(r)omega=int_(s)d omega\delta I=f_{r} \omega=\int_{s} d \omegaδI=frω=sdω
The principle of least action (principle of "extremal history") states that the representative point of the system must travel along a route in the five-dimensional manifold (route with tangent vector d P / d t ) d P / d t ) dP//dt)d \mathscr{P} / d t)dP/dt) such that the variation vanishes for this path; i.e.,
d ω ( , d P / d t ) = 0 d ω ( , d P / d t ) = 0 d omega(dots,dP//dt)=0\boldsymbol{d} \boldsymbol{\omega}(\ldots, d \mathscr{P} / d t)=0dω(,dP/dt)=0
(2-form d ω d ω d omega\boldsymbol{d} \omegadω with a single vector argument supplied, and other slot left unfilled, gives the 1 -form in 5-space that must vanish). This fixes only the direction of d P / d t d P / d t dP//dtd \mathscr{P} / d tdP/dt; its magnitude can be normalized by requiring d t , d P / d t = 1 d t , d P / d t = 1 (:dt,dP//dt:)=1\langle\boldsymbol{d} t, d \mathscr{P} / d t\rangle=1dt,dP/dt=1.
(a) Evaluate d ω d ω d omega\boldsymbol{d} \boldsymbol{\omega}dω from the expression ω = p j d q j H d t ω = p j d q j H d t omega=p_(j)dq^(j)-Hdt\boldsymbol{\omega}=p_{j} \boldsymbol{d} q^{j}-H \boldsymbol{d} tω=pjdqjHdt.
(b) Set d P / d t = q ˙ j ( P / q j ) + p ˙ j ( P / p j ) + i ˙ ( P / t ) d P / d t = q ˙ j P / q j + p ˙ j P / p j + i ˙ ( P / t ) dP//dt=q^(˙)^(j)(delP//delq^(j))+p^(˙)_(j)(delP//delp_(j))+i^(˙)(delP//del t)d \mathscr{P} / d t=\dot{q}^{j}\left(\partial \mathscr{P} / \partial q^{j}\right)+\dot{p}_{j}\left(\partial \mathscr{P} / \partial p_{j}\right)+\dot{i}(\partial \mathscr{P} / \partial t)dP/dt=q˙j(P/qj)+p˙j(P/pj)+i˙(P/t), and expand d ω ( , d P / d t ) = 0 d ω ( , d P / d t ) = 0 d omega(dots,dP//dt)=0\boldsymbol{d} \omega(\ldots, d \mathscr{P} / d t)=0dω(,dP/dt)=0 in terms of the basis { d p j , d q k , d t } d p j , d q k , d t {dp_(j),dq^(k),dt}\left\{\boldsymbol{d} p_{j}, \boldsymbol{d} q^{k}, \boldsymbol{d} t\right\}{dpj,dqk,dt}.
Box 4.5 METRIC STRUCTURE AND HAMILTONIAN OR "SYMPLECTIC STRUCTURE" COMPARED AND CONTRASTED
Metric structure Symplectic structure
1. Physical application Geometry of spacetime Hamiltonian mechanics
2. Canonical structure ( ) = " d s 2 " = d t d t + d x d x + d y d y + d z d z ( ) = " d s 2 " = d t d t + d x d x + d y d y + d z d z {:[(dots*)="ds^(2)"=-dt ox dt],[quad+dx ox dx+dy ox dy],[quad+dz ox dz]:}\begin{aligned} & (\ldots \cdot)=" \boldsymbol{d} \boldsymbol{s}^{2} "=-\boldsymbol{d} t \otimes \boldsymbol{d} t \\ & \quad+\boldsymbol{d} x \otimes \boldsymbol{d} x+\boldsymbol{d} y \otimes \boldsymbol{d} y \\ & \quad+\boldsymbol{d} z \otimes \boldsymbol{d} z \end{aligned}()="ds2"=dtdt+dxdx+dydy+dzdz O = d p 1 d q 1 + d p 2 d q 2 O = d p 1 d q 1 + d p 2 d q 2 O=dp_(1)^^dq^(1)+dp_(2)^^dq^(2)\mathcal{O}=\boldsymbol{d} p_{1} \wedge \boldsymbol{d} q^{1}+\boldsymbol{d} p_{2} \wedge \boldsymbol{d} q^{2}O=dp1dq1+dp2dq2
3. Nature of "metric" Symmetric Antisymmetric
4. Name for given coordinate system and any other set of four coordinates in which metric has same form Lorentz coordinate system System of "canonically" (or "dynamically") conjugate coordinates
5. Field equation for this metric R μ ν α β = 0 R μ ν α β = 0 R_(mu nu alpha beta)=0\boldsymbol{R}_{\mu \nu \alpha \beta}=0Rμναβ=0 (zero Riemann curvature; flat spacetime) d O = 0 ("closed 2-form"; condition automatically satisfied by expression above). d O = 0  ("closed 2-form";   condition automatically   satisfied by expression above).  {:[dO=0" ("closed 2-form"; "],[" condition automatically "],[" satisfied by expression above). "]:}\begin{aligned} & \boldsymbol{d} \boldsymbol{O}=0 \text { ("closed 2-form"; } \\ & \text { condition automatically } \\ & \text { satisfied by expression above). } \end{aligned}dO=0 ("closed 2-form";  condition automatically  satisfied by expression above). 
6. The four-dimensional manifold Spacetime Phase space
7. Coordinate-free description of the structure of this manifold R i e m a n n = 0 R i e m a n n = 0 Riemann=0\boldsymbol{R i e m a n n}=0Riemann=0 d O = 0 d O = 0 dO=0\boldsymbol{d} \mathcal{O}=0dO=0
8. Canonical coordinates distinguished from other Make metric take above form (item 2) Make metric take above form (item 2)
Metric structure Symplectic structure 1. Physical application Geometry of spacetime Hamiltonian mechanics 2. Canonical structure "(dots*)=ds^(2)=-dt ox dt quad+dx ox dx+dy ox dy quad+dz ox dz" O=dp_(1)^^dq^(1)+dp_(2)^^dq^(2) 3. Nature of "metric" Symmetric Antisymmetric 4. Name for given coordinate system and any other set of four coordinates in which metric has same form Lorentz coordinate system System of "canonically" (or "dynamically") conjugate coordinates 5. Field equation for this metric R_(mu nu alpha beta)=0 (zero Riemann curvature; flat spacetime) "dO=0 (closed 2-form; condition automatically satisfied by expression above). " 6. The four-dimensional manifold Spacetime Phase space 7. Coordinate-free description of the structure of this manifold Riemann=0 dO=0 8. Canonical coordinates distinguished from other Make metric take above form (item 2) Make metric take above form (item 2)| | Metric structure | Symplectic structure | | :---: | :---: | :---: | | 1. Physical application | Geometry of spacetime | Hamiltonian mechanics | | 2. Canonical structure | $\begin{aligned} & (\ldots \cdot)=" \boldsymbol{d} \boldsymbol{s}^{2} "=-\boldsymbol{d} t \otimes \boldsymbol{d} t \\ & \quad+\boldsymbol{d} x \otimes \boldsymbol{d} x+\boldsymbol{d} y \otimes \boldsymbol{d} y \\ & \quad+\boldsymbol{d} z \otimes \boldsymbol{d} z \end{aligned}$ | $\mathcal{O}=\boldsymbol{d} p_{1} \wedge \boldsymbol{d} q^{1}+\boldsymbol{d} p_{2} \wedge \boldsymbol{d} q^{2}$ | | 3. Nature of "metric" | Symmetric | Antisymmetric | | 4. Name for given coordinate system and any other set of four coordinates in which metric has same form | Lorentz coordinate system | System of "canonically" (or "dynamically") conjugate coordinates | | 5. Field equation for this metric | $\boldsymbol{R}_{\mu \nu \alpha \beta}=0$ (zero Riemann curvature; flat spacetime) | $\begin{aligned} & \boldsymbol{d} \boldsymbol{O}=0 \text { ("closed 2-form"; } \\ & \text { condition automatically } \\ & \text { satisfied by expression above). } \end{aligned}$ | | 6. The four-dimensional manifold | Spacetime | Phase space | | 7. Coordinate-free description of the structure of this manifold | $\boldsymbol{R i e m a n n}=0$ | $\boldsymbol{d} \mathcal{O}=0$ | | 8. Canonical coordinates distinguished from other | Make metric take above form (item 2) | Make metric take above form (item 2) |
(c) Show that this five-dimensional equation can be written in the 4-dimensional phase space of { q j , p k } q j , p k {q^(j),p_(k)}\left\{q^{j}, p_{k}\right\}{qj,pk} as
O ( , d P / d t ) = d H O ( , d P / d t ) = d H O(dots,dP//dt)=dH\boldsymbol{O}(\ldots, d \mathscr{P} / d t)=\boldsymbol{d} HO(,dP/dt)=dH
where O O O\mathcal{O}O is the 2 -form defined in Box 4.5.
(d) Show that the components of O ( , d P / d t ) = d H O ( , d P / d t ) = d H O(dots,dP//dt)=dH\boldsymbol{O}(\ldots, d \mathscr{P} / d t)=\boldsymbol{d} HO(,dP/dt)=dH in the { q j , p k } q j , p k {q^(j),p_(k)}\left\{q^{j}, p_{k}\right\}{qj,pk} coordinate system are the familiar Hamilton equations. Note that this conclusion depends only on the form assumed for Θ Θ Theta\boldsymbol{\Theta}Θ, so that one also obtains the standard Hamilton equations in any other phase-space coordinates { q ¯ j , p ¯ k } q ¯ j , p ¯ k { bar(q)^(j), bar(p)_(k)}\left\{\bar{q}^{j}, \bar{p}_{k}\right\}{q¯j,p¯k} ("canonical variables") for which
O = d p ¯ 1 d q ¯ 1 + d p 2 d q ¯ 2 O = d p ¯ 1 d q ¯ 1 + d p 2 ¯ d q ¯ 2 O=d bar(p)_(1)^^d bar(q)^(1)+d bar(p_(2))^^d bar(q)^(2)\boldsymbol{O}=\boldsymbol{d} \bar{p}_{1} \wedge \boldsymbol{d} \bar{q}^{1}+\boldsymbol{d} \overline{p_{2}} \wedge \boldsymbol{d} \bar{q}^{2}O=dp¯1dq¯1+dp2dq¯2

Exercise 4.12. SYMMETRY OPERATIONS AS TENSORS

We define the meaning of square and round brackets enclosing a set of indices as follows:
V ( α 1 α p ) 1 p ! Σ V α α 1 , α π p , V [ α 1 α p ] 1 p ! Σ ( 1 ) π V α π i α π , V α 1 α p 1 p ! Σ V α α 1 , α π p , V α 1 α p 1 p ! Σ ( 1 ) π V α π i α π , V_((alpha_(1)dotsalpha_(p)))-=(1)/(p!)SigmaV_(alpha_(alpha_(1),dots)dotsalpha_(pi p)),quadV_([alpha_(1)dotsalpha_(p)])-=(1)/(p!)Sigma(-1)^(pi)V_(alpha_(pi i)dotsalpha_(pi,))V_{\left(\alpha_{1} \ldots \alpha_{p}\right)} \equiv \frac{1}{p!} \Sigma V_{\alpha_{\alpha_{1}, \ldots} \ldots \alpha_{\pi p}}, \quad V_{\left[\alpha_{1} \ldots \alpha_{p}\right]} \equiv \frac{1}{p!} \Sigma(-1)^{\pi} V_{\alpha_{\pi i} \ldots \alpha_{\pi,}}V(α1αp)1p!ΣVαα1,απp,V[α1αp]1p!Σ(1)πVαπiαπ,

Box 4.6 BIRTH OF STOKES' THEOREM

Central to the mathematical formulation of electromagnetism are the theorems of Gauss (taken up in Chapter 5) and Stokes. Both today appear together as one unity when expressed in the language of forms. In earlier times the unity was not evident. Everitt (1970) recalls the history of Stokes' theorem: "The Smith's Prize paper set by [G. C.] Stokes [Lucasian Professor of Mathematics] and taken by Maxwell in [February] 1854 ...
5. Given the centre and two points of an ellipse, and the length of the major axis, find its direction by a geometrical construction.
6. Integrate the differential equation
( a 2 x 2 ) d y 2 + 2 x y d y d x + ( a 2 y 2 ) d x 2 = 0 . a 2 x 2 d y 2 + 2 x y d y d x + a 2 y 2 d x 2 = 0 . (a^(2)-x^(2))dy^(2)+2xydydx+(a^(2)-y^(2))dx^(2)=0.\left(a^{2}-x^{2}\right) d y^{2}+2 x y d y d x+\left(a^{2}-y^{2}\right) d x^{2}=0 .(a2x2)dy2+2xydydx+(a2y2)dx2=0.
Has it a singular solution?
7. In a double system of curves of double curvature, a tangent is always drawn at the variable point P P PPP; shew that, as P P PPP moves away from an arbitrary fixed point Q Q QQQ, it must begin to move along a generating line of an elliptic cone having Q Q QQQ for vertex in order that consecutive tangents may ultimately intersect, but that the conditions of the problem may be impossible.
8. If X , Y , Z X , Y , Z X,Y,ZX, Y, ZX,Y,Z be functions of the rectangular co-ordinates x , y , z , d S x , y , z , d S x,y,z,dSx, y, z, d Sx,y,z,dS an element of any limited surface, l , m , n l , m , n l,m,nl, m, nl,m,n the cosines of the inclinations of the normal at d S d S dSd SdS to the axes, d s d s dsd sds an element of the bounding line, shew that
{ l ( d Z d y d Y d z ) + m ( d X d z d Z d x ) + n ( d Y d x d X d y ) } d S = ( X d x d s + Y d y d s + Z d z d s ) d s l d Z d y d Y d z + m d X d z d Z d x + n d Y d x d X d y d S = X d x d s + Y d y d s + Z d z d s d s {:[∬{l((dZ)/(dy)-(dY)/(dz))+m((dX)/(dz)-(dZ)/(dx))+n((dY)/(dx)-(dX)/(dy))}dS],[=int(X(dx)/(ds)+Y(dy)/(ds)+Z(dz)/(ds))ds]:}\begin{gathered} \iint\left\{l\left(\frac{d Z}{d y}-\frac{d Y}{d z}\right)+m\left(\frac{d X}{d z}-\frac{d Z}{d x}\right)+n\left(\frac{d Y}{d x}-\frac{d X}{d y}\right)\right\} d S \\ =\int\left(X \frac{d x}{d s}+Y \frac{d y}{d s}+Z \frac{d z}{d s}\right) d s \end{gathered}{l(dZdydYdz)+m(dXdzdZdx)+n(dYdxdXdy)}dS=(Xdxds+Ydyds+Zdzds)ds
the differential coefficients of X , Y , Z X , Y , Z X,Y,ZX, Y, ZX,Y,Z being partial, and the single integral being taken all round the perimeter of the surface
marks the first appearance in print of the formula connecting line and surface integrals now known as Stokes' theorem. This was of great importance to Maxwell's development of electromagnetic theory. The earliest explicit proof of the theorem appears to be that given in a letter from Thomson to Stokes dated July 2, 1850." [Quoted in Campbell and Garnett (1882), pp. 186-187.]
Here the sum is taken over all permutations π π pi\piπ of the numbers 1 , 2 , , p 1 , 2 , , p 1,2,dots,p1,2, \ldots, p1,2,,p, and ( 1 ) π ( 1 ) π (-1)^(pi)(-1)^{\pi}(1)π is +1 or -1 depending on whether the permutation is even or odd. The quantity V V VVV may have other indices, not shown here, besides the set of p p ppp indices α 1 , α 2 , , α p α 1 , α 2 , , α p alpha_(1),alpha_(2),dots,alpha_(p)\alpha_{1}, \alpha_{2}, \ldots, \alpha_{p}α1,α2,,αp, but only this set of indices is affected by the operations described here. The numbers π 1 , π 2 , , π p π 1 , π 2 , , π p pi_(1),pi_(2),dots,pi_(p)\pi_{1}, \pi_{2}, \ldots, \pi_{p}π1,π2,,πp are the numbers 1 , 2 , , p 1 , 2 , , p 1,2,dots,p1,2, \ldots, p1,2,,p rearranged according to the permutation π π pi\piπ. (Cases p = 2 , 3 p = 2 , 3 p=2,3p=2,3p=2,3 were treated in exercise 3.12.) We therefore have machinery to convert any rank-p tensor with components V α 1 α p V α 1 α p V_(alpha_(1)dotsalpha_(p))V_{\alpha_{1} \ldots \alpha_{p}}Vα1αp into a new tensor with components
[ A / t ( V ) ] μ 1 μ p = V [ μ 1 μ p ] . [ A / t ( V ) ] μ 1 μ p = V μ 1 μ p . [A//t(V)]_(mu_(1)dotsmu_(p))=V_([mu_(1)dotsmu_(p)]).[\boldsymbol{A} / \boldsymbol{t}(\boldsymbol{V})]_{\mu_{1} \ldots \mu_{p}}=V_{\left[\mu_{1} \ldots \mu_{p}\right]} .[A/t(V)]μ1μp=V[μ1μp].
Since this machinery A / t A / t A//t\boldsymbol{A} / \boldsymbol{t}A/t is linear, it can be viewed as a tensor which, given suitable arguments u , v , , w , α , β , , γ u , v , , w , α , β , , γ u,v,dots,w,alpha,beta,dots,gamma\boldsymbol{u}, \boldsymbol{v}, \ldots, \boldsymbol{w}, \boldsymbol{\alpha}, \boldsymbol{\beta}, \ldots, \boldsymbol{\gamma}u,v,,w,α,β,,γ produces a number
u μ v ν w λ α [ μ β ν γ λ ] u μ v ν w λ α [ μ β ν γ λ ] u^(mu)v^(nu)dotsw^(lambda)alpha_([mu)beta_(nu)dotsgamma_(lambda])u^{\mu} v^{\nu} \ldots w^{\lambda} \alpha_{[\mu} \beta_{\nu} \ldots \gamma_{\lambda]}uμvνwλα[μβνγλ]
(a) Show that the components of this tensor are
( A I ) β 1 β p α 1 α p = ( p ! ) 1 δ β 1 β p α 1 α p ( A I ) β 1 β p α 1 α p = ( p ! ) 1 δ β 1 β p α 1 α p (AI)_(beta_(1)dotsbeta_(p))^(alpha_(1)dotsalpha_(p))=(p!)^(-1)delta_(beta_(1)dotsbeta_(p))^(alpha_(1)dotsalpha_(p))(\boldsymbol{A} \boldsymbol{I})_{\beta_{1} \ldots \beta_{p}}{ }^{\alpha_{1} \ldots \alpha_{p}}=(p!)^{-1} \delta_{\beta_{1} \ldots \beta_{p}}^{\alpha_{1} \ldots \alpha_{p}}(AI)β1βpα1αp=(p!)1δβ1βpα1αp
(Note: indices of δ δ delta\deltaδ are almost never raised or lowered, so this notation leads to no confusion.)
where
δ β 1 β p α 1 α p = { + 1 if ( α 1 , , α p ) is an even permutation of ( β 1 , , β p ) , 1 if ( α 1 , , α p ) is an odd permutation of ( β 1 , , β p ) , 0 if (i) any two of the α 's are the same, 0 if (ii) any two of the β 's are the same, 0 if (iii) the α 's and β 's are different sets of integers. δ β 1 β p α 1 α p = + 1  if  α 1 , , α p  is an even permutation of  β 1 , , β p , 1  if  α 1 , , α p  is an odd permutation of  β 1 , , β p , 0  if (i) any two of the  α  's are the same,  0  if (ii) any two of the  β  's are the same,  0  if (iii) the  α  's and  β  's are different sets of integers.  delta_(beta_(1)dotsbeta_(p))^(alpha_(1)dotsalpha_(p))={[+1" if "(alpha_(1),dots,alpha_(p))" is an even permutation of "(beta_(1),dots,beta_(p))","],[-1" if "(alpha_(1),dots,alpha_(p))" is an odd permutation of "(beta_(1),dots,beta_(p))","],[0" if (i) any two of the "alpha" 's are the same, "],[0" if (ii) any two of the "beta" 's are the same, "],[0" if (iii) the "alpha" 's and "beta" 's are different sets of integers. "]:}\delta_{\beta_{1} \ldots \beta_{p}}^{\alpha_{1} \ldots \alpha_{p}}=\left\{\begin{array}{l} +1 \text { if }\left(\alpha_{1}, \ldots, \alpha_{p}\right) \text { is an even permutation of }\left(\beta_{1}, \ldots, \beta_{p}\right), \\ -1 \text { if }\left(\alpha_{1}, \ldots, \alpha_{p}\right) \text { is an odd permutation of }\left(\beta_{1}, \ldots, \beta_{p}\right), \\ 0 \text { if (i) any two of the } \alpha \text { 's are the same, } \\ 0 \text { if (ii) any two of the } \beta \text { 's are the same, } \\ 0 \text { if (iii) the } \alpha \text { 's and } \beta \text { 's are different sets of integers. } \end{array}\right.δβ1βpα1αp={+1 if (α1,,αp) is an even permutation of (β1,,βp),1 if (α1,,αp) is an odd permutation of (β1,,βp),0 if (i) any two of the α 's are the same, 0 if (ii) any two of the β 's are the same, 0 if (iii) the α 's and β 's are different sets of integers. 
Note that the demonstration, and therefore these component values, are correct in any frame.
(b) Show for any "alternating" (i.e., "completely antisymmetric") tensor A α 1 α p = A [ α 1 α p ] A α 1 α p = A α 1 α p A_(alpha_(1)dotsalpha_(p))=A_([alpha_(1)dotsalpha_(p)])A_{\alpha_{1} \ldots \alpha_{p}}=A_{\left[\alpha_{1} \ldots \alpha_{p}\right]}Aα1αp=A[α1αp] that
1 p ! A α 1 α p δ γ 1 γ p γ p + 1 γ p + q α 1 α p β 1 β q = α 1 < α 2 < < α p A α 1 α p δ γ 1 p γ p + q α 1 α p β 1 β q A | α 1 α p | δ γ 1 γ p + q α 1 α p β q 1 p ! A α 1 α p δ γ 1 γ p γ p + 1 γ p + q α 1 α p β 1 β q = α 1 < α 2 < < α p A α 1 α p δ γ 1 p γ p + q α 1 α p β 1 β q A α 1 α p δ γ 1 γ p + q α 1 α p β q {:[(1)/(p!)A_(alpha_(1)dotsalpha_(p))delta_(gamma_(1)dotsgamma_(p)gamma_(p+1)dotsgamma_(p+q))^(alpha_(1)dotsalpha_(p)beta_(1)dotsbeta_(q))],[=sum_(alpha_(1) < alpha_(2) < dots < alpha_(p))A_(alpha_(1)dotsalpha_(p))delta_(gamma_(1)dots dots p dotsgamma_(p+q))^(alpha_(1)dotsalpha_(p)beta_(1)dotsbeta_(q))],[-=A_(|alpha_(1)dotsalpha_(p∣)|)delta_(gamma_(1)dots dots dots dotsgamma_(p+q))^(alpha_(1)dotsalpha_(p)dotsbeta_(q))]:}\begin{aligned} \frac{1}{p!} A_{\alpha_{1} \ldots \alpha_{p}} & \delta_{\gamma_{1} \ldots \gamma_{p} \gamma_{p+1} \ldots \gamma_{p+q}}^{\alpha_{1} \ldots \alpha_{p} \beta_{1} \ldots \beta_{q}} \\ & =\sum_{\alpha_{1}<\alpha_{2}<\ldots<\alpha_{p}} A_{\alpha_{1} \ldots \alpha_{p}} \delta_{\gamma_{1} \ldots \ldots p \ldots \gamma_{p+q}}^{\alpha_{1} \ldots \alpha_{p} \beta_{1} \ldots \beta_{q}} \\ & \equiv A_{\left|\alpha_{1} \ldots \alpha_{p \mid}\right|} \delta_{\gamma_{1} \ldots \ldots \ldots \ldots \gamma_{p+q}}^{\alpha_{1} \ldots \alpha_{p} \ldots \beta_{q}} \end{aligned}1p!Aα1αpδγ1γpγp+1γp+qα1αpβ1βq=α1<α2<<αpAα1αpδγ1pγp+qα1αpβ1βqA|α1αp|δγ1γp+qα1αpβq
The final line here introduces the convention that a summation over indices enclosed between vertical bars includes only terms with those indices in increasing order. Show, consequently or similarly, that
δ γ 1 γ p + q α 1 α p β 1 β q δ | β 1 β q | μ 1 μ q = δ γ 1 γ p + q α 1 α p μ 1 μ q . δ γ 1 γ p + q α 1 α p β 1 β q δ β 1 β q μ 1 μ q = δ γ 1 γ p + q α 1 α p μ 1 μ q . delta_(gamma_(1)dots dots dots dotsgamma_(p+q))^(alpha_(1)dotsalpha_(p)beta_(1)dotsbeta_(q))delta_(|beta_(1)dotsbeta_(q)|)^(mu_(1)dotsmu_(q))=delta_(gamma_(1)dots dots dots dotsgamma_(p+q))^(alpha_(1)dotsalpha_(p)mu_(1)dotsmu_(q)).\delta_{\gamma_{1} \ldots \ldots \ldots \ldots \gamma_{p+q}}^{\alpha_{1} \ldots \alpha_{p} \beta_{1} \ldots \beta_{q}} \delta_{\left|\beta_{1} \ldots \beta_{q}\right|}^{\mu_{1} \ldots \mu_{q}}=\delta_{\gamma_{1} \ldots \ldots \ldots \ldots \gamma_{p+q}}^{\alpha_{1} \ldots \alpha_{p} \mu_{1} \ldots \mu_{q}} .δγ1γp+qα1αpβ1βqδ|β1βq|μ1μq=δγ1γp+qα1αpμ1μq.
(c) Define the exterior ("wedge") product of any two alternating tensors by
( α β ) λ 1 λ p + q = δ λ 1 λ p λ p + 1 λ p + q μ 1 μ p ν 1 ν q α | μ 1 μ p | β | ν 1 ν q | ; ( α β ) λ 1 λ p + q = δ λ 1 λ p λ p + 1 λ p + q μ 1 μ p ν 1 ν q α μ 1 μ p β ν 1 ν q ; (alpha^^beta)_(lambda_(1)dotslambda_(p+q))=delta_(lambda_(1)dotslambda_(p)lambda_(p+1)dotslambda_(p+q))^(mu_(1)dotsmu_(p)nu_(1)dotsnu_(q))alpha_(|mu_(1)dotsmu_(p)|)beta_(|nu_(1)dotsnu_(q)|);(\boldsymbol{\alpha} \wedge \boldsymbol{\beta})_{\lambda_{1} \ldots \lambda_{p+q}}=\delta_{\lambda_{1} \ldots \lambda_{p} \lambda_{p+1} \ldots \lambda_{p+q}}^{\mu_{1} \ldots \mu_{p} \nu_{1} \ldots \nu_{q}} \alpha_{\left|\mu_{1} \ldots \mu_{p}\right|} \beta_{\left|\nu_{1} \ldots \nu_{q}\right|} ;(αβ)λ1λp+q=δλ1λpλp+1λp+qμ1μpν1νqα|μ1μp|β|ν1νq|;
and similarly
( U V ) λ 1 λ p + q = δ μ 1 μ p p 1 v q λ 1 λ p λ p 1 + λ p + q U | μ 1 μ p | V | p 1 ν q | ( U V ) λ 1 λ p + q = δ μ 1 μ p p 1 v q λ 1 λ p λ p 1 + λ p + q U μ 1 μ p V p 1 ν q (U^^V)^(lambda_(1)dotslambda_(p+q))=delta_(mu_(1)dotsmu_(p)p_(1)dotsv_(q))^(lambda_(1)dotslambda_(p)lambda_(p1+)dotslambda_(p+q))U^(|mu_(1)dotsmu_(p)|)V^(|p_(1)dotsnu_(q)|)(\boldsymbol{U} \wedge \boldsymbol{V})^{\lambda_{1} \ldots \lambda_{p+q}}=\delta_{\mu_{1} \ldots \mu_{p} p_{1} \ldots v_{q}}^{\lambda_{1} \ldots \lambda_{p} \lambda_{p 1+} \ldots \lambda_{p+q}} U^{\left|\mu_{1} \ldots \mu_{p}\right|} V^{\left|p_{1} \ldots \nu_{q}\right|}(UV)λ1λp+q=δμ1μpp1vqλ1λpλp1+λp+qU|μ1μp|V|p1νq|
Show that this implies equation (3.45b). Establish the associative law for this product rule by showing that
[ ( α β ) γ ] σ 1 σ p + q + r [ ( α β ) γ ] σ 1 σ p + q + r {:{:[(alpha^^beta)^^gamma]_(sigma_(1)dotssigma_(p+q+r)):}:}\begin{aligned} & {[(\boldsymbol{\alpha} \wedge \boldsymbol{\beta}) \wedge \boldsymbol{\gamma}]_{\sigma_{1} \ldots \sigma_{p+q+r}}} \end{aligned}[(αβ)γ]σ1σp+q+r
= [ α ( β γ ) ] σ 1 σ p + α + r ; = [ α ( β γ ) ] σ 1 σ p + α + r ; {:=[alpha^^(beta^^gamma)]_(sigma_(1)dotssigma_(p+alpha+r));:}\begin{aligned} & =[\boldsymbol{\alpha} \wedge(\boldsymbol{\beta} \wedge \boldsymbol{\gamma})]_{\sigma_{1} \ldots \sigma_{p+\alpha+r}} ; \end{aligned}=[α(βγ)]σ1σp+α+r;
and show that this reduces to the 3-form version of Equation (3.45c) when α , β α , β alpha,beta\boldsymbol{\alpha}, \boldsymbol{\beta}α,β, and γ γ gamma\boldsymbol{\gamma}γ are all 1 -forms.
(d) Derive the following formula for the components of the exterior product of p p ppp vectors
( u 1 u 2 u p ) α 1 α p = δ μ ν α 1 α p ( u 1 ) μ ( u p ) v = p ! u 1 [ α 1 u 2 α 2 u p α p ] = δ 12 p α 1 α 2 α p det [ ( u μ ) λ ] . u 1 u 2 u p α 1 α p = δ μ ν α 1 α p u 1 μ u p v = p ! u 1 α 1 u 2 α 2 u p α p = δ 12 p α 1 α 2 α p det u μ λ . {:[(u_(1)^^u_(2)^^cdots^^u_(p))^(alpha_(1)dotsalpha_(p))=delta_(mu dots nu)^(alpha_(1)dotsalpha_(p))(u_(1))^(mu)dots(u_(p))^(v)],[=p!u_(1)^([alpha_(1):})u_(2)^(alpha_(2))dotsu_(p)^({:alpha_(p)])],[=delta_(12 dots p)^(alpha_(1)alpha_(2)dotsalpha_(p))det[(u_(mu))^(lambda)].]:}\begin{aligned} \left(\boldsymbol{u}_{1} \wedge \boldsymbol{u}_{2} \wedge \cdots \wedge \boldsymbol{u}_{p}\right)^{\alpha_{1} \ldots \alpha_{p}} & =\delta_{\mu \ldots \nu}^{\alpha_{1} \ldots \alpha_{p}}\left(u_{1}\right)^{\mu} \ldots\left(u_{p}\right)^{v} \\ & =p!u_{1}^{\left[\alpha_{1}\right.} u_{2}^{\alpha_{2}} \ldots u_{p}^{\left.\alpha_{p}\right]} \\ & =\delta_{12 \ldots p}^{\alpha_{1} \alpha_{2} \ldots \alpha_{p}} \operatorname{det}\left[\left(u_{\mu}\right)^{\lambda}\right] . \end{aligned}(u1u2up)α1αp=δμνα1αp(u1)μ(up)v=p!u1[α1u2α2upαp]=δ12pα1α2αpdet[(uμ)λ].

CHAPTER
5

STRESS-ENERGY TENSOR AND CONSERVATION LAWS

§5.1. TRACK-1 OVERVIEW

"Geometry tells matter how to move, and matter tells geometry how to curve." However, it will do no good to look into curvature (Part III) and Einstein's law for the production of curvature by mass-energy (Part IV) until a tool can be found to determine how much mass-energy there is in a unit volume. That tool is the stress-energy tensor. It is the focus of attention in this chapter.
The essential features of the stress-energy tensor are summarized in Box 5.1 for the benefit of readers who want to rush on into gravitation physics as quickly as
possible. Such readers can proceed directly from Box 5.1 into Chapter 6-though by doing so, they close the door on several later portions of track two, which lean
heavily on material treated in this chapter.

§5.2. THREE-DIMENSIONAL VOLUMES AND DEFINITION OF THE STRESS-ENERGY TENSOR

The rest of this chapter is

Track 2.

It depends on no preceding

Track-2 material.

It is needed as preparation for Chapter 20 (conservation laws for mass and angular momentum).
It will be extremely helpful in all applications of gravitation theory (Chapters 18-40).
Spacetime contains a flowing "river" of 4-momentum. Each particle carries its 4-momentum vector with itself along its world line. Many particles, on many world lines, viewed in a smeared-out manner (continuum approximation), produce a continuum flow-a river of 4-momentum. Electromagnetic fields, neutrino fields, meson fields: they too contribute to the river.
How can the flow of the river be quantified? By means of a linear machine: the stress-energy tensor T T T\boldsymbol{T}T.
Choose a small, three-dimensional parallelepiped in spacetime with vectors A A A\boldsymbol{A}A, B , C B , C B,C\boldsymbol{B}, \boldsymbol{C}B,C for edges (Figure 5.1). Ask how much 4 -momentum crosses that volume in

Box 5.1 CHAPTER 5 SUMMARIZED

A. STRESS-ENERGY TENSOR AS A MACHINE

At each event in spacetime, there exists a stress-energy tensor. It is a machine that contains a knowledge of the energy density, momentum density, and stress as measured by any and all observers at that event. Included are energy, momentum, and stress associated with all forms of matter and all nongravitational fields.
The stress-energy tensor is a linear, symmetric machine with two slots for the insertion of two vectors: T ( , ) T ( , ) T(dots,dots)\boldsymbol{T}(\ldots, \ldots)T(,). Its output, for given input, can be summarized as follows.
(1) Insert the 4-velocity u u u\boldsymbol{u}u of an observer into one of the slots; leave the other slot empty. The output is
T ( u , ) = T ( , u ) = ( density of 4-momentum, "d d p / d V , " i.e., 4-momentum per unit of three-dimensional volume, as measured in observer's Lorentz frame at event where T is chosen ) ; T ( u , ) = T ( , u ) =  density of 4-momentum,   "d  d p / d V , "  i.e., 4-momentum   per unit of three-dimensional volume,   as measured in observer's   Lorentz frame at event where  T  is chosen  ; T(u,dots)=T(dots,u)=-([" density of 4-momentum, "],[" "d "dp//dV",""" i.e., 4-momentum "],[" per unit of three-dimensional volume, "],[" as measured in observer's "],[" Lorentz frame at event where "],[T" is chosen "]);\boldsymbol{T}(\boldsymbol{u}, \ldots)=\boldsymbol{T}(\ldots, \boldsymbol{u})=-\left(\begin{array}{l} \text { density of 4-momentum, } \\ \text { "d } d \boldsymbol{p} / d V, " \text { i.e., 4-momentum } \\ \text { per unit of three-dimensional volume, } \\ \text { as measured in observer's } \\ \text { Lorentz frame at event where } \\ \boldsymbol{T} \text { is chosen } \end{array}\right) ;T(u,)=T(,u)=( density of 4-momentum,  "d dp/dV," i.e., 4-momentum  per unit of three-dimensional volume,  as measured in observer's  Lorentz frame at event where T is chosen );
i.e., T α β u β = T β α u β = ( d p α / d V ) T α β u β = T β α u β = d p α / d V T^(alpha)_(beta)u^(beta)=T_(beta)^(alpha)u^(beta)=-(dp^(alpha)//dV)T^{\alpha}{ }_{\beta} u^{\beta}=T_{\beta}{ }^{\alpha} u^{\beta}=-\left(d p^{\alpha} / d V\right)Tαβuβ=Tβαuβ=(dpα/dV) for observer with 4 -velocity u α u α u^(alpha)u^{\alpha}uα.
(2) Insert 4-velocity of observer into one slot; insert an arbitrary unit vector n n n\boldsymbol{n}n into the other slot. The output is
T ( u , n ) = T ( n , u ) = ( component, " n d p / d V ", of 4 -momentum density along the n direction, as measured in observer's Lorentz frame ) T ( u , n ) = T ( n , u ) =  component, "  n d p / d V  ", of  4 -momentum density along the  n  direction, as measured in   observer's Lorentz frame  T(u,n)=T(n,u)=-([" component, " "n*dp//dV" ", of "],[4"-momentum density along the "],[n" direction, as measured in "],[" observer's Lorentz frame "])\boldsymbol{T}(\boldsymbol{u}, \boldsymbol{n})=\boldsymbol{T}(\boldsymbol{n}, \boldsymbol{u})=-\left(\begin{array}{l} \text { component, " } \boldsymbol{n} \cdot d \boldsymbol{p} / d V \text { ", of } \\ 4 \text {-momentum density along the } \\ \boldsymbol{n} \text { direction, as measured in } \\ \text { observer's Lorentz frame } \end{array}\right)T(u,n)=T(n,u)=( component, " ndp/dV ", of 4-momentum density along the n direction, as measured in  observer's Lorentz frame )
i.e., T α β u α n β = T α β n α u β = n μ d p μ / d V T α β u α n β = T α β n α u β = n μ d p μ / d V T_(alpha beta)u^(alpha)n^(beta)=T_(alpha beta)n^(alpha)u^(beta)=-n_(mu)dp^(mu)//dVT_{\alpha \beta} u^{\alpha} n^{\beta}=T_{\alpha \beta} n^{\alpha} u^{\beta}=-n_{\mu} d p^{\mu} / d VTαβuαnβ=Tαβnαuβ=nμdpμ/dV.
(3) Insert 4-velocity of observer into both slots. The output is the density of massenergy that he measures in his Lorentz frame:
T ( u , u ) = ( mass-energy per unit volume as measured in frame with 4 -velocity u ) T ( u , u ) = (  mass-energy per unit volume as measured   in frame with  4 -velocity  u ) T(u,u)=((" mass-energy per unit volume as measured ")/(" in frame with "4"-velocity "u))\boldsymbol{T}(\boldsymbol{u}, \boldsymbol{u})=\binom{\text { mass-energy per unit volume as measured }}{\text { in frame with } 4 \text {-velocity } \boldsymbol{u}}T(u,u)=( mass-energy per unit volume as measured  in frame with 4-velocity u)
(4) Pick an observer and choose two spacelike basis vectors, e j e j e_(j)\boldsymbol{e}_{j}ej and e k e k e_(k)\boldsymbol{e}_{k}ek, of his Lorentz frame. Insert e j e j e_(j)\boldsymbol{e}_{j}ej and e k e k e_(k)\boldsymbol{e}_{k}ek into the slots of T T T\boldsymbol{T}T. The output is the j , k j , k j,kj, kj,k component of the stress as measured by that observer:
T j k = T ( e j , e k ) = T k j = T ( e k , e j ) T j k = T e j , e k = T k j = T e k , e j T_(jk)=T(e_(j),e_(k))=T_(kj)=T(e_(k),e_(j))T_{j k}=\boldsymbol{T}\left(\boldsymbol{e}_{j}, \boldsymbol{e}_{k}\right)=\boldsymbol{T}_{k j}=\boldsymbol{T}\left(\boldsymbol{e}_{k}, \boldsymbol{e}_{j}\right)Tjk=T(ej,ek)=Tkj=T(ek,ej)
= ( j -component of force acting from side x k ε to side x k + ε , across a unit surface area with perpendicular direction e k ) = ( k -component of force acting from side x j ε to side x j + ε , across a unit surface area with perpendicular direction e j ) . = j -component of force acting   from side  x k ε  to side  x k + ε ,  across a unit surface area with   perpendicular direction  e k = k -component of force acting   from side  x j ε  to side  x j + ε ,  across a unit surface area with   perpendicular direction  e j . =([j"-component of force acting "],[" from side "x^(k)-epsi" to side "x^(k)+epsi","],[" across a unit surface area with "],[" perpendicular direction "e_(k)])=([k"-component of force acting "],[" from side "x^(j)-epsi" to side "x^(j)+epsi","],[" across a unit surface area with "],[" perpendicular direction "e_(j)]).=\left(\begin{array}{l} j \text {-component of force acting } \\ \text { from side } x^{k}-\varepsilon \text { to side } x^{k}+\varepsilon, \\ \text { across a unit surface area with } \\ \text { perpendicular direction } \boldsymbol{e}_{k} \end{array}\right)=\left(\begin{array}{l} k \text {-component of force acting } \\ \text { from side } x^{j}-\varepsilon \text { to side } x^{j}+\varepsilon, \\ \text { across a unit surface area with } \\ \text { perpendicular direction } \boldsymbol{e}_{j} \end{array}\right) .=(j-component of force acting  from side xkε to side xk+ε, across a unit surface area with  perpendicular direction ek)=(k-component of force acting  from side xjε to side xj+ε, across a unit surface area with  perpendicular direction ej).

Box 5.1 (continued)

B. STRESS-ENERGY TENSOR FOR A PERFECT FLUID

One type of matter studied extensively later in this book is a "perfect fluid." A perfect fluid is a fluid or gas that (1) moves through spacetime with a 4 -velocity u u u\boldsymbol{u}u which may vary from event to event, and (2) exhibits a density of mass-energy ρ ρ rho\rhoρ and an isotropic pressure p p ppp in the rest frame of each fluid element. Shear stresses, anisotropic pressures, and viscosity must be absent, or the fluid is not perfect. The stress-energy tensor for a perfect fluid at a given event can be constructed from the metric tensor, g g g\boldsymbol{g}g, the 4 -velocity, u u u\boldsymbol{u}u, and the rest-frame density and pressure, ρ ρ rho\rhoρ and p p ppp :
T = ( ρ + p ) u u + p g , or T α β = ( ρ + p ) u α u β + p g α β T = ( ρ + p ) u u + p g ,  or  T α β = ( ρ + p ) u α u β + p g α β T=(rho+p)u ox u+pg,quad" or "T_(alpha beta)=(rho+p)u_(alpha)u_(beta)+pg_(alpha beta)\boldsymbol{T}=(\rho+p) \boldsymbol{u} \otimes \boldsymbol{u}+p \boldsymbol{g}, \quad \text { or } T_{\alpha \beta}=(\rho+p) u_{\alpha} u_{\beta}+p g_{\alpha \beta}T=(ρ+p)uu+pg, or Tαβ=(ρ+p)uαuβ+pgαβ
In the fluid's rest frame, the components of this stress-energy tensor have the expected form (insert into a slot of T T T\boldsymbol{T}T, as 4-velocity of observer, just the fluid's 4 -velocity):
T α β u β = [ ( ρ + p ) u α u β + p δ α β ] u β = ( ρ + p ) u α + p u α = ρ u α ; T α β u β = ( ρ + p ) u α u β + p δ α β u β = ( ρ + p ) u α + p u α = ρ u α ; T^(alpha)_(beta)u^(beta)=[(rho+p)u^(alpha)u_(beta)+pdelta^(alpha)_(beta)]u^(beta)=-(rho+p)u^(alpha)+pu^(alpha)=-rhou^(alpha);T^{\alpha}{ }_{\beta} u^{\beta}=\left[(\rho+p) u^{\alpha} u_{\beta}+p \delta^{\alpha}{ }_{\beta}\right] u^{\beta}=-(\rho+p) u^{\alpha}+p u^{\alpha}=-\rho u^{\alpha} ;Tαβuβ=[(ρ+p)uαuβ+pδαβ]uβ=(ρ+p)uα+puα=ρuα;
i.e.,
T 0 β u β = ρ = ( mass-energy density ) = d p 0 / d V , T β j u β = 0 = ( momentum density ) = d p j / d V T 0 β u β = ρ = (  mass-energy density  ) = d p 0 / d V , T β j u β = 0 = (  momentum density  ) = d p j / d V {:[T^(0)_(beta)u^(beta)=-rho=-(" mass-energy density ")=-dp^(0)//dV","],[T_(_(beta))^(j)u^(beta)=0=-(" momentum density ")=-dp^(j)//dV]:}\begin{aligned} & T^{0}{ }_{\beta} u^{\beta}=-\rho=-(\text { mass-energy density })=-d p^{0} / d V, \\ & T_{{ }_{\beta}}^{j} u^{\beta}=0=-(\text { momentum density })=-d p^{j} / d V \end{aligned}T0βuβ=ρ=( mass-energy density )=dp0/dV,Tβjuβ=0=( momentum density )=dpj/dV
also
T j k = T ( e j , e k ) = p δ j k = stress-tensor components. T j k = T e j , e k = p δ j k =  stress-tensor components.  T_(jk)=T(e_(j),e_(k))=pdelta_(jk)=" stress-tensor components. "T_{j k}=\boldsymbol{T}\left(\boldsymbol{e}_{j}, \boldsymbol{e}_{k}\right)=p \delta_{j k}=\text { stress-tensor components. }Tjk=T(ej,ek)=pδjk= stress-tensor components. 

C. CONSERVATION OF ENERGY-MOMENTUM

In electrodynamics the conservation of charge can be expressed by the differential equation
( charge density ) / t + ( current density ) = 0 (  charge density  ) / t + (  current density  ) = 0 del(" charge density ")//del t+grad*(" current density ")=0\partial(\text { charge density }) / \partial t+\boldsymbol{\nabla} \cdot(\text { current density })=0( charge density )/t+( current density )=0
i.e., J 0 , 0 + J = 0 J 0 , 0 + J = 0 J^(0)_(,0)+grad*J=0J^{0}{ }_{, 0}+\boldsymbol{\nabla} \cdot \boldsymbol{J}=0J0,0+J=0; i.e. J α , α = 0 J α , α = 0 J^(alpha)_(,alpha)=0J^{\alpha}{ }_{, \alpha}=0Jα,α=0; i.e., J = 0 J = 0 grad*J=0\boldsymbol{\nabla} \cdot \boldsymbol{J}=0J=0. Similarly, conservation of energy-momentum can be expressed by the fundamental geometric law
T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0
(Because T T T\boldsymbol{T}T is symmetric, it does not matter on which slot the divergence is taken.) This law plays an important role in gravitation theory.
Figure 5.1.
The "river" of 4 -momentum flowing through spacetime, and three different 3 -volumes across which it flows. (One dimension is suppressed from the picture; so the 3 -volumes look like 2 -volumes.) The first 3 -volume is the interior of a cubical soap box momentarily at rest in the depicted Lorentz frame. Its edges are L e x , L e y , L e z L e x , L e y , L e z Le_(x),Le_(y),Le_(z)L \boldsymbol{e}_{x}, L \boldsymbol{e}_{y}, L \boldsymbol{e}_{z}Lex,Ley,Lez; and its volume 1 -form, with "positive" sense toward future ("standard orientation"), is Σ = L 3 d t = V u ( V = L 3 = Σ = L 3 d t = V u V = L 3 = Sigma=L^(3)dt=-Vu(V=L^(3)=:}\boldsymbol{\Sigma}=L^{3} \boldsymbol{d} t=-V \boldsymbol{u}\left(V=L^{3}=\right.Σ=L3dt=Vu(V=L3= volume as measured in rest frame; u = d t = 4 u = d t = 4 u=-dt=4\boldsymbol{u}=-\boldsymbol{d} t=4u=dt=4-velocity of box). The second 3 -volume is the "world sheet" swept out in time Δ τ Δ τ Delta tau\Delta \tauΔτ by the top of a second cubical box. The box top's edges are L e x L e x Le_(x)L \boldsymbol{e}_{x}Lex and L e z L e z Le_(z)L \boldsymbol{e}_{z}Lez; and its volume 1 -form, with "positive" sense away from the box's interior, in direction of increasing y y yyy, is Σ = L 2 Δ τ d y = a Δ τ σ Σ = L 2 Δ τ d y = a Δ τ σ Sigma=L^(2)Delta tau dy=a Delta tau sigma\boldsymbol{\Sigma}=L^{2} \Delta \tau \boldsymbol{d} y=a \Delta \tau \boldsymbol{\sigma}Σ=L2Δτdy=aΔτσ ( a = L 2 = a = L 2 = a=L^(2)=a=L^{2}=a=L2= area of box top; σ = d y = σ = d y = sigma=dy=\boldsymbol{\sigma}=\boldsymbol{d} y=σ=dy= unit 1 -form containing world tube). The third 3 -volume is an arbitrary one, with edges A A A\boldsymbol{A}A, B , C B , C B,C\boldsymbol{B}, \boldsymbol{C}B,C and volume 1-form Σ μ = ϵ μ α β γ A α B β C γ Σ μ = ϵ μ α β γ A α B β C γ Sigma_(mu)=epsilon_(mu alpha beta gamma)A^(alpha)B^(beta)C^(gamma)\Sigma_{\mu}=\epsilon_{\mu \alpha \beta \gamma} A^{\alpha} B^{\beta} C^{\gamma}Σμ=ϵμαβγAαBβCγ.
its positive sense (i.e., from its "negative side" toward its "positive side"). To calculate the answer: (1) Construct the "volume 1-form"
(5.1) Σ μ = + ϵ μ α β γ A α B β C γ (5.1) Σ μ = + ϵ μ α β γ A α B β C γ {:(5.1)Sigma_(mu)=+epsilon_(mu alpha beta gamma)A^(alpha)B^(beta)C^(gamma):}\begin{equation*} \Sigma_{\mu}=+\epsilon_{\mu \alpha \beta \gamma} A^{\alpha} B^{\beta} C^{\gamma} \tag{5.1} \end{equation*}(5.1)Σμ=+ϵμαβγAαBβCγ
the parallelepiped lies in one of the 1 -form surfaces, and the positive sense across the parallelepiped is defined to be the positive sense of the 1 -form Σ Σ Sigma\boldsymbol{\Sigma}Σ. (2) Insert this volume 1 -form into the second slot of the stress-energy tensor T T T\boldsymbol{T}T. The result is
(5.2) T ( , Σ ) = p = ( momentum crossing from negative side toward positive side ) . empty slot (5.2) T ( , Σ ) = p = (  momentum crossing from   negative side toward positive side  ) .  empty   slot  {:[(5.2)T(dots","Sigma)=p=((" momentum crossing from ")/(" negative side toward positive side ")).],[" empty "uarr],[" slot "]:}\begin{align*} & \boldsymbol{T}(\ldots, \boldsymbol{\Sigma})=\boldsymbol{p}=\binom{\text { momentum crossing from }}{\text { negative side toward positive side }} . \tag{5.2}\\ & \text { empty } \uparrow \\ & \text { slot } \end{align*}(5.2)T(,Σ)=p=( momentum crossing from  negative side toward positive side ). empty  slot 
(3) To get the projection of the 4-momentum along a vector w w w\boldsymbol{w}w or 1 -form α α alpha\boldsymbol{\alpha}α, insert the volume 1-form Σ Σ Sigma\boldsymbol{\Sigma}Σ into the second slot and w w w\boldsymbol{w}w or α α alpha\boldsymbol{\alpha}α into the first:
(5.3) T ( w , Σ ) = w p , T ( α , Σ ) = α , p (5.3) T ( w , Σ ) = w p , T ( α , Σ ) = α , p {:(5.3)T(w","Sigma)=w*p","quad T(alpha","Sigma)=(:alpha","p:):}\begin{equation*} \boldsymbol{T}(\boldsymbol{w}, \boldsymbol{\Sigma})=\boldsymbol{w} \cdot \boldsymbol{p}, \quad \boldsymbol{T}(\boldsymbol{\alpha}, \boldsymbol{\Sigma})=\langle\boldsymbol{\alpha}, \boldsymbol{p}\rangle \tag{5.3} \end{equation*}(5.3)T(w,Σ)=wp,T(α,Σ)=α,p
This defines the stress-energy tensor.
Mathematical representation of 3 -volumes
Momentum crossing a 3 -volume calculated, using stress-energy tensor
The key features of 3 -volumes and the stress-energy tensor are encapsulated by the above three-step procedure. But encapsulation is not sufficient; deep understanding is also required. To gain it, one must study special cases, both of 3 -volumes and of the operation of the stress-energy machinery.

A Special Case

Interior of a soap box:
Its volume 1-form
Its 4-momentum content
Its energy density
A soap box moves through spacetime. A man at an event P 0 P 0 P_(0)\mathscr{P}_{0}P0 on the box's world line peers inside it, and examines all the soap, air, and electromagnetic fields it contains. He adds up all their 4-momenta to get a grand total p boxat 9 0 p boxat  9 0 p_("boxat "9_(0))\boldsymbol{p}_{\text {boxat } 9_{0}}pboxat 90. How much is this grand total? One can calculate it by noting that the 4 -momentum inside the box at P 0 P 0 P_(0)\mathscr{P}_{0}P0 is precisely the 4 -momentum crossing the box from past toward future there (Figure 5.1). Hence, the 4 -momentum the man measures is
(5.4) p box at Q 0 = T ( , Σ ) (5.4) p box at  Q 0 = T ( , Σ ) {:(5.4)p_("box at "Q_(0))=T(dots","Sigma):}\begin{equation*} \boldsymbol{p}_{\text {box at } \mathscr{Q}_{0}}=\boldsymbol{T}(\ldots, \boldsymbol{\Sigma}) \tag{5.4} \end{equation*}(5.4)pbox at Q0=T(,Σ)
where Σ Σ Sigma\boldsymbol{\Sigma}Σ is the box's volume 1 -form at P 0 P 0 P_(0)\mathscr{\mathscr { P }}_{0}P0. But for such a soap box, Σ Σ Sigma\boldsymbol{\Sigma}Σ has a magnitude equal to the box's volume V V VVV as measured by a man in its momentary rest frame, and the box itself lies in one of the hyperplanes of Σ Σ Sigma\boldsymbol{\Sigma}Σ; equivalently,
(5.5) Σ = V u (5.5) Σ = V u {:(5.5)Sigma=-Vu:}\begin{equation*} \boldsymbol{\Sigma}=-V \boldsymbol{u} \tag{5.5} \end{equation*}(5.5)Σ=Vu
where u u u\boldsymbol{u}u is the soap box's 4 -velocity at P 0 P 0 P_(0)\mathscr{P}_{0}P0 (minus sign because u u u\boldsymbol{u}u, regarded as a 1 -form, has positive sense toward the past, u 0 < 0 u 0 < 0 u_(0) < 0u_{0}<0u0<0 ); see Box 5.2. Hence, the total 4 -momentum inside the box is
(5.6) p boxat φ 0 = T ( , V u ) = V T ( , u ) (5.6) p boxat  φ 0 = T ( , V u ) = V T ( , u ) {:(5.6)p_("boxat "varphi_(0))=T(dots","-Vu)=-VT(dots","u):}\begin{equation*} \boldsymbol{p}_{\text {boxat } \mathscr{\varphi}_{0}}=\boldsymbol{T}(\ldots,-V \boldsymbol{u})=-V \boldsymbol{T}(\ldots, \boldsymbol{u}) \tag{5.6} \end{equation*}(5.6)pboxat φ0=T(,Vu)=VT(,u)
or, in component notation,
(5.6') ( p α ) box at g 0 = V T α β u β (5.6') p α box at  g 0 = V T α β u β {:(5.6')(p^(alpha))_("box at "g_(0))=-VT^(alpha beta)u_(beta):}\begin{equation*} \left(p^{\alpha}\right)_{\text {box at } \mathscr{g}_{0}}=-V T^{\alpha \beta} u_{\beta} \tag{5.6'} \end{equation*}(5.6')(pα)box at g0=VTαβuβ
The energy in the box, as measured in its rest frame, is minus the projection of the 4 -momentum on the box's 4 -velocity:
E = u p boxat P 0 = + V T α β u α u β = V T ( u , u ) E = u p boxat P 0 = + V T α β u α u β = V T ( u , u ) E=-u*p_(boxatP_(0))=+VT^(alpha beta)u_(alpha)u_(beta)=VT(u,u)E=-\boldsymbol{u} \cdot \boldsymbol{p}_{\mathrm{boxat} \mathscr{P}_{0}}=+V T^{\alpha \beta} u_{\alpha} u_{\beta}=V \boldsymbol{T}(\boldsymbol{u}, \boldsymbol{u})E=upboxatP0=+VTαβuαuβ=VT(u,u)
so
(5.7) energy density as ( measured in box's rest frame ) = E V = T ( u , u ) . (5.7)  energy density as  (  measured in box's   rest frame  ) = E V = T ( u , u ) . {:[(5.7)" energy density as "],[((" measured in box's ")/(" rest frame "))=(E)/(V)=T(u","u).]:}\begin{align*} & \text { energy density as } \tag{5.7}\\ & \binom{\text { measured in box's }}{\text { rest frame }}=\frac{E}{V}=\boldsymbol{T}(\boldsymbol{u}, \boldsymbol{u}) . \end{align*}(5.7) energy density as ( measured in box's  rest frame )=EV=T(u,u).

Another Special Case

A man riding with the same soap box opens its top and pours out some soap. In a very small interval of time Δ τ Δ τ Delta tau\Delta \tauΔτ, how much total 4-momentum flows out of the box?

Box 5.2 THREE-DIMENSIONAL VOLUMES

A. General Parallelepiped

  1. Edges of parallelepiped are three vectors A A A\boldsymbol{A}A, B , C B , C B,C\boldsymbol{B}, \boldsymbol{C}B,C. One must order the edges; e.g., " A A A\boldsymbol{A}A is followed by B B B\boldsymbol{B}B is followed by C C C\boldsymbol{C}C."

    (One dimension, that orthogonal to the parallelepiped, is suppressed here.)
  2. Volume trivector is defined to be A B C A B C A^^B^^C\boldsymbol{A} \wedge \boldsymbol{B} \wedge \boldsymbol{C}ABC. It enters into the sophisticated theory of volumes (Chapter 4), but is not used much in the elementary theory.
  3. Volume 1-form is defined by Σ μ = ϵ μ α β γ A α B β C γ Σ μ = ϵ μ α β γ A α B β C γ Sigma_(mu)=epsilon_(mu alpha beta gamma)A^(alpha)B^(beta)C^(gamma)\Sigma_{\mu}=\epsilon_{\mu \alpha \beta \gamma} A^{\alpha} B^{\beta} C^{\gamma}Σμ=ϵμαβγAαBβCγ. ( A , B , C A , B , C A,B,C\boldsymbol{A}, \boldsymbol{B}, \boldsymbol{C}A,B,C must appear here in standard order as chosen in step 1.) Note that the vector "corresponding" to Σ Σ Sigma\boldsymbol{\Sigma}Σ and the volume trivector are related by Σ = ( A B C ) Σ = ( A B C ) Sigma=-^(**)(A^^B^^C)\boldsymbol{\Sigma}=-{ }^{*}(\boldsymbol{A} \wedge \boldsymbol{B} \wedge \boldsymbol{C})Σ=(ABC).
  4. Orientation of the volume is defined to agree with the orientation of its 1 -form Σ Σ Sigma\boldsymbol{\Sigma}Σ. More specifically: the edges A , B , C A , B , C A,B,C\boldsymbol{A}, \boldsymbol{B}, \boldsymbol{C}A,B,C lie in a hyperplane of Σ ( Σ , A = Σ , B = Σ , C = 0 Σ ( Σ , A = Σ , B = Σ , C = 0 Sigma((:Sigma,A:)=(:Sigma,B:)=(:Sigma,C:)=0\boldsymbol{\Sigma}(\langle\boldsymbol{\Sigma}, \boldsymbol{A}\rangle=\langle\boldsymbol{\Sigma}, \boldsymbol{B}\rangle=\langle\boldsymbol{\Sigma}, \boldsymbol{C}\rangle=0Σ(Σ,A=Σ,B=Σ,C=0; no "bongs of bell"). Thus, the volume itself is one of Σ Σ Sigma\boldsymbol{\Sigma}Σ 's hyperplanes! The positive sense moving away from the volume is defined to be the positive sense of Σ Σ Sigma\boldsymbol{\Sigma}Σ. Note: reversing the order of A , B , C A , B , C A,B,C\boldsymbol{A}, \boldsymbol{B}, \boldsymbol{C}A,B,C reverses the positive

    (One dimension, that along which C C C\boldsymbol{C}C extends, is suppressed here.) sense!
  5. The "standard orientation" for a spacelike 3-volume has the positive sense of the 1-form Σ Σ Sigma\boldsymbol{\Sigma}Σ toward the future, corresponding to A , B , C A , B , C A,B,C\boldsymbol{A}, \boldsymbol{B}, \boldsymbol{C}A,B,C forming a righthanded triad of vectors.
    B. 3-Volumes of Arbitrary Shape
Can be analyzed by being broken up into union of parallelepipeds.

C. Interior of a Soap Box (Example)

  1. Analysis in soap box's rest frame. Pick an event on the box's world line. The box's three edges there are three specific vectors A , B , C A , B , C A,B,C\boldsymbol{A}, \boldsymbol{B}, \boldsymbol{C}A,B,C. In the box's rest frame they are purely spatial: A 0 = B 0 = C 0 = 0 A 0 = B 0 = C 0 = 0 A^(0)=B^(0)=C^(0)=0A^{0}=B^{0}=C^{0}=0A0=B0=C0=0. Hence, the volume 1-form has components Σ j = 0 Σ j = 0 Sigma_(j)=0\Sigma_{j}=0Σj=0 and
    Σ 0 = ϵ 0 i j k A i B j C k = det A 1 A 2 A 3 B 1 B 2 B 3 C 1 C 2 C 3 Σ 0 = ϵ 0 i j k A i B j C k = det A 1 A 2 A 3 B 1 B 2 B 3 C 1 C 2 C 3 Sigma_(0)=epsilon_(0ijk)A^(i)B^(j)C^(k)=det||[A^(1),A^(2),A^(3)],[B^(1),B^(2),B^(3)],[C^(1),C^(2),C^(3)]||\Sigma_{0}=\epsilon_{0 i j k} A^{i} B^{j} C^{k}=\operatorname{det}\left\|\begin{array}{lll}A^{1} & A^{2} & A^{3} \\ B^{1} & B^{2} & B^{3} \\ C^{1} & C^{2} & C^{3}\end{array}\right\|Σ0=ϵ0ijkAiBjCk=detA1A2A3B1B2B3C1C2C3
    = A ( B × C ) = A ( B × C ) =A*(B xx C)=\boldsymbol{A} \cdot(\boldsymbol{B} \times \boldsymbol{C})=A(B×C), in the standard notation of 3-dimensional vector analysis;
    = + V ( V = = + V ( V = =+V(V==+V(V==+V(V= volume of box) if ( A , B , C ) ( A , B , C ) (A,B,C)(\boldsymbol{A}, \boldsymbol{B}, \boldsymbol{C})(A,B,C) are righthand ordered (positive sense of Σ Σ Sigma\boldsymbol{\Sigma}Σ toward future; standard orientation);
    = V ( V = = V ( V = =-V(V==-V(V==V(V= volume of box ) ) ))) if ( A , B , C ) ( A , B , C ) (A,B,C)(\boldsymbol{A}, \boldsymbol{B}, \boldsymbol{C})(A,B,C) are lefthand ordered (positive sense of Σ Σ Sigma\boldsymbol{\Sigma}Σ toward past).
  2. This result reexpressed in geometric language: Let u u u\boldsymbol{u}u be the box's 4-velocity and V V VVV be its volume, as measured in its rest frame. Then either
    Σ = V u Σ = V u Sigma=-Vu\boldsymbol{\Sigma}=-V \boldsymbol{u}Σ=Vu, in which case the "positive side" of the box's 3 -surface is the future side, and its edges are ordered in a righthanded manner-the standard orientation;
    or else
    Σ = + V u Σ = + V u Sigma=+Vu\boldsymbol{\Sigma}=+V \boldsymbol{u}Σ=+Vu, in which case the "positive side" is the past side, and the box's edges are ordered in a lefthanded manner.

D. 3-Volume Swept Out in Time Δ τ Δ τ Delta tau\Delta \tauΔτ by Two-Dimensional Top of a Soap Box (Example)

  1. Analysis in box's rest frame: Pick an event on box's world line. There the two edges of the box top are vectors A A A\boldsymbol{A}A and B B B\boldsymbol{B}B. In the box's rest frame, orient the space axes so that A A A\boldsymbol{A}A and B B B\boldsymbol{B}B lie in the y , z y , z y,zy, zy,z-plane. During the lapse of a proper time Δ τ Δ τ Delta tau\Delta \tauΔτ, the box top sweeps out
    a 3 -volume whose third edge is u Δ τ ( u = u Δ τ ( u = u Delta tau(u=\boldsymbol{u} \Delta \tau(\boldsymbol{u}=uΔτ(u= 4 -velocity of box). In the box's rest-frame, with ordering " A A A\boldsymbol{A}A followed by B B B\boldsymbol{B}B followed by u Δ τ u Δ τ u Delta tau\boldsymbol{u} \Delta \tauuΔτ, " the volume 1 -form has components
    Σ 0 = Σ 2 = Σ 3 = 0 Σ 0 = Σ 2 = Σ 3 = 0 Sigma_(0)=Sigma_(2)=Sigma_(3)=0\Sigma_{0}=\Sigma_{2}=\Sigma_{3}=0Σ0=Σ2=Σ3=0, and
    Σ 1 = ϵ 1 j k 0 A j B k Δ τ u 0 = ϵ 01 j k A j B k Δ τ Σ 1 = ϵ 1 j k 0 A j B k Δ τ u 0 = ϵ 01 j k A j B k Δ τ Sigma_(1)=epsilon_(1jk0)A^(j)B^(k)Delta tauu^(0)=-epsilon_(01 jk)A^(j)B^(k)Delta tau\Sigma_{1}=\epsilon_{1 j k 0} A^{j} B^{k} \Delta \tau u^{0}=-\epsilon_{01 j k} A^{j} B^{k} \Delta \tauΣ1=ϵ1jk0AjBkΔτu0=ϵ01jkAjBkΔτ
    = a Δ τ ( a = = a Δ τ ( a = =-a Delta tau(a==-a \Delta \tau(a==aΔτ(a= area of box top ) ) ))) if ( e x , A , B ) e x , A , B (e_(x),A,B)\left(\boldsymbol{e}_{x}, \boldsymbol{A}, \boldsymbol{B}\right)(ex,A,B) are righthand ordered
    = + a Δ τ ( a = = + a Δ τ ( a = =+a Delta tau(a==+a \Delta \tau(a==+aΔτ(a= area of box top ) ) ))) if ( e x , A , B ) e x , A , B (e_(x),A,B)\left(\boldsymbol{e}_{x}, \boldsymbol{A}, \boldsymbol{B}\right)(ex,A,B) are lefthand ordered.
    (Note: No standard orientation can be defined in this case, because Σ Σ Sigma\boldsymbol{\Sigma}Σ can be carried continuously into Σ Σ -Sigma-\boldsymbol{\Sigma}Σ by purely spatial rotations.)
  2. This result reexpressed in geometric language: Let a a aaa be the area of the box top as measured in its rest frame; and let σ σ sigma\sigmaσ be a unit 1-form, one of whose surfaces contains the box top and its 4 -velocity (i.e., contains the box top's "world sheet"). Orient the positive sense of σ σ sigma\sigmaσ with the (arbitrarily chosen) positive sense of the box-top 3 -volume. Then
Σ = a Δ τ σ Σ = a Δ τ σ Sigma=a Delta tau sigma\boldsymbol{\Sigma}=a \Delta \tau \sigmaΣ=aΔτσ
To answer this question, consider the three-dimensional volume swept out during Δ τ Δ τ Delta tau\Delta \tauΔτ by the box's opened two-dimensional top ("world sheet of top"). The 4-momentum asked for is the 4 -momentum that crosses this world sheet in the positive sense (see Figure 5.1); hence, it is
(5.8) p flows out = T ( , Σ ) , (5.8) p flows out  = T ( , Σ ) , {:(5.8)p_("flows out ")=T(dots","Sigma)",":}\begin{equation*} \boldsymbol{p}_{\text {flows out }}=\boldsymbol{T}(\ldots, \boldsymbol{\Sigma}), \tag{5.8} \end{equation*}(5.8)pflows out =T(,Σ),
where Σ Σ Sigma\boldsymbol{\Sigma}Σ is the world sheet's volume 1 -form. Let a a aaa be the area of the box top, and σ σ sigma\sigmaσ be the outward-oriented unit 1-form, whose surfaces contain the world sheet (i.e., contain the box top and its momentary 4 -velocity vector). Then
(5.9) Σ = a Δ τ σ (5.9) Σ = a Δ τ σ {:(5.9)Sigma=a Delta tau sigma:}\begin{equation*} \boldsymbol{\Sigma}=a \Delta \tau \boldsymbol{\sigma} \tag{5.9} \end{equation*}(5.9)Σ=aΔτσ
(see Box 5.2 ); so the 4 -momentum that flows out during Δ τ Δ τ Delta tau\Delta \tauΔτ is
(5.10) p flows out = a Δ τ T ( , σ ) (5.10) p flows out  = a Δ τ T ( , σ ) {:(5.10)p_("flows out ")=a Delta tau T(dots","sigma):}\begin{equation*} \boldsymbol{p}_{\text {flows out }}=a \Delta \tau \boldsymbol{T}(\ldots, \boldsymbol{\sigma}) \tag{5.10} \end{equation*}(5.10)pflows out =aΔτT(,σ)

§5.3. COMPONENTS OF STRESS-ENERGY TENSOR

Like all other tensors, the stress-energy tensor is a machine whose definition and significance transcend coordinate systems and reference frames. But any one observer, locked as he is into some one Lorentz frame, pays more attention to the components of T T T\boldsymbol{T}T than to T T T\boldsymbol{T}T itself. To each component he ascribes a specific physical significance. Of greatest interest, perhaps, is the "time-time" component. It is the total density of mass-energy as measured in the observer's Lorentz frame:
(5.11) T 00 = T 0 0 = T 00 = T ( e 0 , e 0 ) = density of mass-energy (5.11) T 00 = T 0 0 = T 00 = T e 0 , e 0 =  density of mass-energy  {:(5.11)T_(00)=-T_(0)^(0)=T^(00)=T(e_(0),e_(0))=" density of mass-energy ":}\begin{equation*} T_{00}=-T_{0}{ }^{0}=T^{00}=\boldsymbol{T}\left(\boldsymbol{e}_{0}, \boldsymbol{e}_{0}\right)=\text { density of mass-energy } \tag{5.11} \end{equation*}(5.11)T00=T00=T00=T(e0,e0)= density of mass-energy 
(cf. equation 5.7, with the observer's 4-velocity u u u\boldsymbol{u}u replaced by the basis vector e 0 = u e 0 = u e_(0)=u\boldsymbol{e}_{0}=\boldsymbol{u}e0=u ).
The "spacetime" components T j 0 T j 0 T^(j0)T^{j 0}Tj0 can be interpreted by considering the interior of a soap box at rest in the observer's frame. If its volume is V V VVV, then its volume 1 -form is Σ = V u = + V d t Σ = V u = + V d t Sigma=-Vu=+Vdt\boldsymbol{\Sigma}=-V \boldsymbol{u}=+V \boldsymbol{d} tΣ=Vu=+Vdt; and the μ μ mu\muμ-component of 4 -momentum inside it is
p μ = d x μ , p = T ( d x μ , Σ ) = V T ( d x μ , d t ) = V T μ 0 . p μ = d x μ , p = T d x μ , Σ = V T d x μ , d t = V T μ 0 . p^(mu)=(:dx^(mu),p:)=T(dx^(mu),Sigma)=VT(dx^(mu),dt)=VT^(mu0).p^{\mu}=\left\langle\boldsymbol{d} x^{\mu}, \boldsymbol{p}\right\rangle=\boldsymbol{T}\left(\boldsymbol{d} x^{\mu}, \boldsymbol{\Sigma}\right)=V \boldsymbol{T}\left(\boldsymbol{d} x^{\mu}, \boldsymbol{d} t\right)=V T^{\mu 0} .pμ=dxμ,p=T(dxμ,Σ)=VT(dxμ,dt)=VTμ0.
Thus, the 4 -momentum per unit volume is
(5.12a) p μ / V = T μ 0 , (5.12a) p μ / V = T μ 0 , {:(5.12a)p^(mu)//V=T^(mu0)",":}\begin{equation*} p^{\mu} / V=T^{\mu 0}, \tag{5.12a} \end{equation*}(5.12a)pμ/V=Tμ0,
or, equivalently:
(5.13a) T 00 = density of mass-energy (units: g / cm 3 , or erg / cm 3 , or cm 2 ) ; (5.13b) T j 0 = density of j -component of momentum ( units: g ( cm / sec ) cm 3 , or cm 2 ) (5.13a) T 00 =  density of mass-energy   (units:  g / cm 3 , or erg  / cm 3 , or  cm 2 ; (5.13b) T j 0 =  density of  j -component of momentum   units:  g ( cm / sec ) cm 3 ,  or  cm 2 {:[(5.13a)T^(00)=" density of mass-energy "],[" (units: "{:g//cm^(3)", or erg "//cm^(3)", or "cm^(-2));],[(5.13b)T^(j0)=" density of "j"-component of momentum "],[(" units: "g((cm)//sec)cm^(-3)," or "cm^(-2))]:}\begin{align*} T^{00}= & \text { density of mass-energy } \tag{5.13a}\\ & \text { (units: } \left.\mathrm{g} / \mathrm{cm}^{3} \text {, or erg } / \mathrm{cm}^{3} \text {, or } \mathrm{cm}^{-2}\right) ; \\ T^{j 0}= & \text { density of } \mathrm{j} \text {-component of momentum } \tag{5.13b}\\ & \left(\text { units: } \mathrm{g}(\mathrm{~cm} / \mathrm{sec}) \mathrm{cm}^{-3}, \text { or } \mathrm{cm}^{-2}\right) \end{align*}(5.13a)T00= density of mass-energy  (units: g/cm3, or erg /cm3, or cm2);(5.13b)Tj0= density of j-component of momentum ( units: g( cm/sec)cm3, or cm2)
The top of a soap box:
Its volume 1-form
Its 4-momentum that flows across
Physical interpretation of stress-energy tensor's components:
T 0 k T 0 k T^(0k)T^{0 k}T0k : energy flux
T j k T j k T^(jk)T^{j k}Tjk : stress
Number-flux vector for swarm of particles defined
During the lapse of time Δ t Δ t Delta t\Delta tΔt, this 2 -surface sweeps out a 3 -volume with volume 1 -form Σ = a Δ t d x k Σ = a Δ t d x k Sigma=a Delta tdx^(k)\boldsymbol{\Sigma}=a \Delta t \boldsymbol{d} x^{k}Σ=aΔtdxk (see Box 5.2). The μ μ mu\muμ-component of 4 -momentum that crosses the 2 -surface in time Δ t Δ t Delta t\Delta tΔt is
p μ = T ( d x μ , Σ ) = a Δ t T ( d x μ , d x k ) = a Δ t T μ k p μ = T d x μ , Σ = a Δ t T d x μ , d x k = a Δ t T μ k p^(mu)=T(dx^(mu),Sigma)=a Delta tT(dx^(mu),dx^(k))=a Delta tT^(mu k)p^{\mu}=\boldsymbol{T}\left(\boldsymbol{d} x^{\mu}, \boldsymbol{\Sigma}\right)=a \Delta t \boldsymbol{T}\left(\boldsymbol{d} x^{\mu}, \boldsymbol{d} x^{k}\right)=a \Delta t T^{\mu k}pμ=T(dxμ,Σ)=aΔtT(dxμ,dxk)=aΔtTμk
Thus, the flux of 4-momentum (4-momentum crossing a unit surface oriented perpendicular to e k e k e_(k)\boldsymbol{e}_{k}ek, in unit time) is
(5.12b) ( p μ / a Δ t ) crossing surface to e k = T μ k (5.12b) p μ / a Δ t crossing surface   to  e k = T μ k {:(5.12b)(p^(mu)//a Delta t)_("crossing surface "_|_" to "e_(k))=T^(mu k):}\begin{equation*} \left(p^{\mu} / a \Delta t\right)_{\text {crossing surface } \perp \text { to } \boldsymbol{e}_{k}}=T^{\mu k} \tag{5.12b} \end{equation*}(5.12b)(pμ/aΔt)crossing surface  to ek=Tμk
or, equivalently:
T 0 k = k -component of energy flux (units: erg / cm 2 sec , or cm 2 ); T j k = j , k component of "stress" k -component of flux of j -component of momentum j -component of force produced by fields and matter at x k ϵ acting on fields and matter at x k + ϵ across a unit surface, the perpendicular to which is e k (units: dynes / cm 2 , or cm 2 ). T 0 k = k -component of energy flux   (units: erg  / cm 2 sec , or  cm 2  );  T j k = j , k  component of "stress"  k -component of flux of  j -component of momentum  j -component of force produced by fields and matter at  x k ϵ  acting   on fields and matter at  x k + ϵ  across a unit surface, the perpendicular   to which is  e k  (units: dynes  / cm 2 ,  or  cm 2  ).  {:[T^(0k)=k"-component of energy flux "],[quad" (units: erg "//cm^(2)sec", or "cm^(-2)" ); "],[T^(jk)=j","k" component of "stress" "],[-=k"-component of flux of "j"-component of momentum "],[-=j"-component of force produced by fields and matter at "x^(k)-epsilon" acting "],[" on fields and matter at "x^(k)+epsilon" across a unit surface, the perpendicular "],[" to which is "e_(k)],[quad" (units: dynes "//cm^(2)","" or "cm^(-2)" ). "]:}\begin{aligned} T^{0 k}= & k \text {-component of energy flux } \\ & \quad \text { (units: erg } / \mathrm{cm}^{2} \mathrm{sec} \text {, or } \mathrm{cm}^{-2} \text { ); } \\ T^{j k}= & j, k \text { component of "stress" } \\ \equiv & k \text {-component of flux of } j \text {-component of momentum } \\ \equiv & j \text {-component of force produced by fields and matter at } x^{k}-\epsilon \text { acting } \\ & \text { on fields and matter at } x^{k}+\epsilon \text { across a unit surface, the perpendicular } \\ & \text { to which is } \boldsymbol{e}_{k} \\ & \quad \text { (units: dynes } / \mathrm{cm}^{2}, \text { or } \mathrm{cm}^{-2} \text { ). } \end{aligned}T0k=k-component of energy flux  (units: erg /cm2sec, or cm2 ); Tjk=j,k component of "stress" k-component of flux of j-component of momentum j-component of force produced by fields and matter at xkϵ acting  on fields and matter at xk+ϵ across a unit surface, the perpendicular  to which is ek (units: dynes /cm2, or cm2 ). 
(Recall that "momentum transfer per second" is the same as "force.")
The stress-energy tensor is necessarily symmetric, T α β = T β α T α β = T β α T^(alpha beta)=T^(beta alpha)T^{\alpha \beta}=T^{\beta \alpha}Tαβ=Tβα; but the proof of this will be delayed until several illustrations have been examined.

§5.4. STRESS-ENERGY TENSOR FOR A SWARM OF PARTICLES

Consider a swarm of particles. Choose some event P P P\mathscr{P}P inside the swarm. Divide the particles near P P P\mathscr{P}P into categories, A = 1 , 2 , A = 1 , 2 , A=1,2,dotsA=1,2, \ldotsA=1,2,, in such a way that all particles in the same category have the same properties:
m ( A ) m ( A ) m_((A))m_{(A)}m(A), rest mass;
u ( A ) u ( A ) u_((A))\boldsymbol{u}_{(A)}u(A), 4-velocity;
p ( A ) = m ( A ) u ( A ) p ( A ) = m ( A ) u ( A ) p_((A))=m_((A))u_((A))\boldsymbol{p}_{(A)}=m_{(A)} \boldsymbol{u}_{(A)}p(A)=m(A)u(A), 4-momentum.
m_((A)), rest mass; u_((A)), 4-velocity; p_((A))=m_((A))u_((A)), 4-momentum.| $m_{(A)}$, | rest mass; | | :--- | :--- | | $\boldsymbol{u}_{(A)}$, | 4-velocity; | | $\boldsymbol{p}_{(A)}=m_{(A)} \boldsymbol{u}_{(A)}$, | 4-momentum. |
Let N ( A ) N ( A ) N_((A))N_{(A)}N(A) be the number of category- A A AAA particles per unit volume, as measured in the particles' own rest frame. Then the "number-flux vector" S ( A ) S ( A ) S_((A))\boldsymbol{S}_{(A)}S(A), defined by
(5.14) s ( A ) N ( A ) u ( A ) (5.14) s ( A ) N ( A ) u ( A ) {:(5.14)s_((A))-=N_((A))u_((A)):}\begin{equation*} \boldsymbol{s}_{(A)} \equiv N_{(A)} \boldsymbol{u}_{(A)} \tag{5.14} \end{equation*}(5.14)s(A)N(A)u(A)
has components with simple physical meanings. In a frame where category- A A AAA particles have ordinary velocity v ( A ) v ( A ) v_((A))v_{(A)}v(A), the meanings are:
(5.15b) S ( A ) = N ( A ) u ( A ) = S ( A ) 0 v ( A ) = flux of particles. (5.15b) S ( A ) = N ( A ) u ( A ) = S ( A ) 0 v ( A ) =  flux of particles.  {:(5.15b)S_((A))=N_((A))u_((A))=S_((A))^(0)v_((A))=" flux of particles. ":}\begin{equation*} \boldsymbol{S}_{(A)}=N_{(A)} \boldsymbol{u}_{(A)}=S_{(A)}^{0} \boldsymbol{v}_{(A)}=\text { flux of particles. } \tag{5.15b} \end{equation*}(5.15b)S(A)=N(A)u(A)=S(A)0v(A)= flux of particles. 
Consequently, the 4-momentum density has components
T ( A ) μ 0 = p ( A ) μ S ( A ) 0 = m ( A ) u ( A ) μ N ( A ) u ( A ) 0 = m ( A ) N ( A ) u ( A ) μ u ( A ) 0 T ( A ) μ 0 = p ( A ) μ S ( A ) 0 = m ( A ) u ( A ) μ N ( A ) u ( A ) 0 = m ( A ) N ( A ) u ( A ) μ u ( A ) 0 {:[T_((A))^(mu0)=p_((A))^(mu)S_((A))^(0)=m_((A))u_((A))^(mu)N_((A))u_((A))^(0)],[=m_((A))N_((A))u_((A))^(mu)u_((A))^(0)]:}\begin{aligned} T_{(A)}^{\mu 0} & =p_{(A)}^{\mu} S_{(A)}^{0}=m_{(A)} u_{(A)}^{\mu} N_{(A)} u_{(A)}^{0} \\ & =m_{(A)} N_{(A)} u_{(A)}^{\mu} u_{(A)}^{0} \end{aligned}T(A)μ0=p(A)μS(A)0=m(A)u(A)μN(A)u(A)0=m(A)N(A)u(A)μu(A)0
and the flux of μ μ mu\muμ-component of momentum across a surface with perpendicular direction e j e j e_(j)\boldsymbol{e}_{j}ej is
T ( A ) μ j = p ( A ) μ S ( A ) j = m ( A ) u ( A ) μ N ( A ) u ( A ) j = m ( A ) N ( A ) u ( A ) μ u ( A ) j . T ( A ) μ j = p ( A ) μ S ( A ) j = m ( A ) u ( A ) μ N ( A ) u ( A ) j = m ( A ) N ( A ) u ( A ) μ u ( A ) j . {:[T_((A))^(mu j)=p_((A))^(mu)S_((A))^(j)=m_((A))u_((A))^(mu)N_((A))u_((A))^(j)],[=m_((A))N_((A))u_((A))^(mu)u_((A))^(j).]:}\begin{aligned} T_{(A)}^{\mu j} & =p_{(A)}^{\mu} S_{(A)}^{j}=m_{(A)} u_{(A)}^{\mu} N_{(A)} u_{(A)}^{j} \\ & =m_{(A)} N_{(A)} u_{(A)}^{\mu} u_{(A)}^{j} . \end{aligned}T(A)μj=p(A)μS(A)j=m(A)u(A)μN(A)u(A)j=m(A)N(A)u(A)μu(A)j.
These equations are precisely the μ , 0 μ , 0 mu,0\mu, 0μ,0 and μ , j μ , j mu,j\mu, jμ,j components of the geometric, frameindependent equation
(5.16) T ( A ) = m ( A ) N ( A ) u ( A ) u ( A ) = p ( A ) S ( A ) (5.16) T ( A ) = m ( A ) N ( A ) u ( A ) u ( A ) = p ( A ) S ( A ) {:(5.16)T_((A))=m_((A))N_((A))u_((A))oxu_((A))=p_((A))oxS_((A)):}\begin{equation*} \boldsymbol{T}_{(A)}=m_{(A)} N_{(A)} \boldsymbol{u}_{(A)} \otimes \boldsymbol{u}_{(A)}=\boldsymbol{p}_{(A)} \otimes \boldsymbol{S}_{(A)} \tag{5.16} \end{equation*}(5.16)T(A)=m(A)N(A)u(A)u(A)=p(A)S(A)
The total number-flux vector and stress-energy tensor for all particles in the swarm near P P P\mathscr{P}P are obtained by summing over all categories:
(5.17) S = A N ( A ) u ( A ) (5.18) T = A m ( A ) N ( A ) u ( A ) u ( A ) = A p ( A ) S ( A ) (5.17) S = A N ( A ) u ( A ) (5.18) T = A m ( A ) N ( A ) u ( A ) u ( A ) = A p ( A ) S ( A ) {:[(5.17)S=sum_(A)N_((A))u_((A))],[(5.18)T=sum_(A)m_((A))N_((A))u_((A))oxu_((A))=sum_(A)p_((A))oxS_((A))]:}\begin{align*} \boldsymbol{S} & =\sum_{A} N_{(A)} \boldsymbol{u}_{(A)} \tag{5.17}\\ \boldsymbol{T} & =\sum_{A} m_{(A)} N_{(A)} \boldsymbol{u}_{(A)} \otimes \boldsymbol{u}_{(A)}=\sum_{A} \boldsymbol{p}_{(A)} \otimes \boldsymbol{S}_{(A)} \tag{5.18} \end{align*}(5.17)S=AN(A)u(A)(5.18)T=Am(A)N(A)u(A)u(A)=Ap(A)S(A)

§5.5. STRESS-ENERGY TENSOR FOR A PERFECT FLUID

There is no simpler example of a fluid than a gas of noninteracting particles ("ideal gas") in which the velocities of the particles are distributed isotropically. In the Lorentz frame where isotropy obtains, symmetry argues equality of the diagonal space-space components of the stress-energy tensor,
(5.19) T x x = T y y = T z z = A m ( A ) v x ( A ) ( 1 v ( A ) 2 ) 1 / 2 N ( A ) v x ( A ) ( 1 v ( A ) ) 2 / 2 (5.19) T x x = T y y = T z z = A m ( A ) v x ( A ) 1 v ( A ) 2 1 / 2 N ( A ) v x ( A ) 1 v ( A ) 2 / 2 {:(5.19)T_(xx)=T_(yy)=T_(zz)=sum_(A)(m_((A))v_(x(A)))/((1-v_((A))^(2))^(1//2))(N_((A))v_(x(A)))/((1-v_((A)))^(2//2)):}\begin{equation*} T_{x x}=T_{y y}=T_{z z}=\sum_{A} \frac{m_{(A)} v_{x(A)}}{\left(1-v_{(A)}^{2}\right)^{1 / 2}} \frac{N_{(A)} v_{x(A)}}{\left(1-v_{(A)}\right)^{2 / 2}} \tag{5.19} \end{equation*}(5.19)Txx=Tyy=Tzz=Am(A)vx(A)(1v(A)2)1/2N(A)vx(A)(1v(A))2/2
and vanishing of all the off-diagonal components. Moreover, (5.19) represents a product: the number of particles per unit volume, multiplied by velocity in the x x xxx-direction (giving flux in the x x xxx-direction) and by momentum in the x x xxx-direction,
Stress-energy tensor for swarm of particles
Ideal gas defined
giving the standard kinetic-theory expression for the pressure, p p ppp. Therefore, the stress-energy tensor takes the form
(5.20) T α β = ρ 0 0 0 0 p 0 0 0 0 p 0 0 0 0 p (5.20) T α β = ρ 0 0 0 0 p 0 0 0 0 p 0 0 0 0 p {:(5.20)T_(alpha beta)=||[rho,0,0,0],[0,p,0,0],[0,0,p,0],[0,0,0,p]||:}T_{\alpha \beta}=\left\|\begin{array}{llll} \rho & 0 & 0 & 0 \tag{5.20}\\ 0 & p & 0 & 0 \\ 0 & 0 & p & 0 \\ 0 & 0 & 0 & p \end{array}\right\|(5.20)Tαβ=ρ0000p0000p0000p
in this special Lorentz frame-the "rest frame" of the gas. Here the quantity ρ ρ rho\rhoρ has nothing directly to do with the rest-masses of the constituent particles. It measures the density of rest-plus-kinetic energy of these particles.
Rewrite (5.20) in terms of the 4-velocity u α = ( 1 , 0 , 0 , 0 ) u α = ( 1 , 0 , 0 , 0 ) u^(alpha)=(1,0,0,0)u^{\alpha}=(1,0,0,0)uα=(1,0,0,0) of the fluid in the gas's rest frame, and find
T α β = ρ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 + 0 0 0 0 0 p 0 0 0 0 p 0 0 0 0 p = ρ u α u β + p ( η α β + u α u β ) , T α β = ρ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 + 0 0 0 0 0 p 0 0 0 0 p 0 0 0 0 p = ρ u α u β + p η α β + u α u β , {:[T_(alpha beta)=||[rho,0,0,0],[0,0,0,0],[0,0,0,0],[0,0,0,0]||+||[0,0,0,0],[0,p,0,0],[0,0,p,0],[0,0,0,p]||],[=rhou_(alpha)u_(beta)+p(eta_(alpha beta)+u_(alpha)u_(beta))","]:}\begin{aligned} T_{\alpha \beta} & =\left\|\begin{array}{llll} \rho & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{array}\right\|+\left\|\begin{array}{llll} 0 & 0 & 0 & 0 \\ 0 & p & 0 & 0 \\ 0 & 0 & p & 0 \\ 0 & 0 & 0 & p \end{array}\right\| \\ & =\rho u_{\alpha} u_{\beta}+p\left(\eta_{\alpha \beta}+u_{\alpha} u_{\beta}\right), \end{aligned}Tαβ=ρ000000000000000+00000p0000p0000p=ρuαuβ+p(ηαβ+uαuβ),
Stress-energy tensor for ideal gas or perfect fluid
or, in frame-independent, geometric language
(5.21) T = p g + ( ρ + p ) u u (5.21) T = p g + ( ρ + p ) u u {:(5.21)T=pg+(rho+p)u ox u:}\begin{equation*} T=p \boldsymbol{g}+(\rho+p) \boldsymbol{u} \otimes \boldsymbol{u} \tag{5.21} \end{equation*}(5.21)T=pg+(ρ+p)uu
Expression (5.21) has general application. It is exact for the "ideal gas" just considered. It is also exact for any fluid that is "perfect" in the sense that it is free of such transport processes as heat conduction and viscosity, and therefore (in the rest frame) free of shear stress (diagonal stress tensor; diagonal components identical, because if they were not identical, a rotation of the frame of reference would reveal presence of shear stress). However, for a general perfect fluid, density ρ ρ rho\rhoρ of mass-energy as measured in the fluid's rest frame includes not only rest mass plus kinetic energy of particles, but also energy of compression, energy of nuclear binding, and all other sources of mass-energy [total density of mass-energy as it might be determined by an idealized experiment, such as that depicted in Figure 1.12, with the sample mass at the center of the sphere, and the test particle executing oscillations of small amplitude about that location, with ω 2 = ( 4 π / 3 ) ρ ] ω 2 = ( 4 π / 3 ) ρ {:omega^(2)=(4pi//3)rho]\left.\omega^{2}=(4 \pi / 3) \rho\right]ω2=(4π/3)ρ].

§5.6. ELECTROMAGNETIC STRESS-ENERGY

Faraday, with his picture of tensions along lines of force and pressures at right angles to them (Figure 5.2), won insight into new features of electromagnetism. In addition to the tension E 2 / 8 π E 2 / 8 π E^(2)//8pi\boldsymbol{E}^{2} / 8 \piE2/8π (or B 2 / 8 π B 2 / 8 π B^(2)//8pi\boldsymbol{B}^{2} / 8 \piB2/8π ) along lines of force, and an equal pressure at right angles, one has the Poynting flux ( E × B ) / 4 π ( E × B ) / 4 π (E xx B)//4pi(\boldsymbol{E} \times \boldsymbol{B}) / 4 \pi(E×B)/4π and the Maxwell expression for the
Figure 5.2.
Faraday stresses at work. When the electromagnet is connected to an alternating current, the aluminum ring flies into the air.
energy density, ( E 2 + B 2 ) / 8 π E 2 + B 2 / 8 π (E^(2)+B^(2))//8pi\left(\boldsymbol{E}^{2}+\boldsymbol{B}^{2}\right) / 8 \pi(E2+B2)/8π. All these quantities find their places in the Maxwell stress-energy tensor, defined by
(5.22) 4 π T μ ν = F μ α F α ν 1 4 η μ ν F α β F α β . (5.22) 4 π T μ ν = F μ α F α ν 1 4 η μ ν F α β F α β . {:(5.22)4piT^(mu nu)=F^(mu alpha)F_(alpha)^(nu)-(1)/(4)eta^(mu nu)F_(alpha beta)F^(alpha beta).:}\begin{equation*} 4 \pi T^{\mu \nu}=F^{\mu \alpha} F_{\alpha}^{\nu}-\frac{1}{4} \eta^{\mu \nu} F_{\alpha \beta} F^{\alpha \beta} . \tag{5.22} \end{equation*}(5.22)4πTμν=FμαFαν14ημνFαβFαβ.
Stress-energy tensor for electromagnetic field

Exercise 5.1.

EXERCISE

Show that expression (5.22), evaluated in a Lorentz coordinate frame, gives
T 00 = ( E 2 + B 2 ) / 8 π , T 0 j = T j 0 = ( E × B ) j / 4 π (5.23) T j k = 1 4 π [ ( E j E k + B j B k ) + 1 2 ( E 2 + B 2 ) δ j k ] T 00 = E 2 + B 2 / 8 π , T 0 j = T j 0 = ( E × B ) j / 4 π (5.23) T j k = 1 4 π E j E k + B j B k + 1 2 E 2 + B 2 δ j k {:[T^(00)=(E^(2)+B^(2))//8pi","quadT^(0j)=T^(j0)=(E xx B)^(j)//4pi],[(5.23)T^(jk)=(1)/(4pi)[-(E^(j)E^(k)+B^(j)B^(k))+(1)/(2)(E^(2)+B^(2))delta^(jk)]]:}\begin{align*} T^{00} & =\left(\boldsymbol{E}^{2}+\boldsymbol{B}^{2}\right) / 8 \pi, \quad T^{0 j}=T^{j 0}=(\boldsymbol{E} \times \boldsymbol{B})^{j} / 4 \pi \\ T^{j k} & =\frac{1}{4 \pi}\left[-\left(E^{j} E^{k}+B^{j} B^{k}\right)+\frac{1}{2}\left(\boldsymbol{E}^{2}+\boldsymbol{B}^{2}\right) \delta^{j k}\right] \tag{5.23} \end{align*}T00=(E2+B2)/8π,T0j=Tj0=(E×B)j/4π(5.23)Tjk=14π[(EjEk+BjBk)+12(E2+B2)δjk]
Show that the stress tensor does describe a tension ( E 2 + B 2 ) / 8 π E 2 + B 2 / 8 π (E^(2)+B^(2))//8pi\left(\boldsymbol{E}^{2}+\boldsymbol{B}^{2}\right) / 8 \pi(E2+B2)/8π along the field lines and a pressure ( E 2 + B 2 ) / 8 π E 2 + B 2 / 8 π (E^(2)+B^(2))//8pi\left(\boldsymbol{E}^{2}+\boldsymbol{B}^{2}\right) / 8 \pi(E2+B2)/8π perpendicular to the field lines, as stated in the text.

§5.7. SYMMETRY OF THE STRESS-ENERGY TENSOR

All the stress-energy tensors explored above were symmetric. That they could not have been otherwise one sees as follows.
Calculate in a specific Lorentz frame. Consider first the momentum density (components T j 0 T j 0 T^(j0)T^{j 0}Tj0 ) and the energy flux (components T 0 j T 0 j T^(0j)T^{0 j}T0j ). They must be equal because
Proof that stress-energy tensor is symmetric energy = = === mass ( " E = M c 2 = M " E = M c 2 = M "E=Mc^(2)=M" E=M c^{2}=M"E=Mc2=M "):
T 0 j = ( energy flux ) = ( energy density ) × ( mean velocity of energy flow ) j (5.24) = ( mass density ) × ( mean velocity of mass flow ) j = ( momentum density ) = T j 0 . T 0 j = (  energy flux  ) = (  energy density  ) × (  mean velocity of energy flow  ) j (5.24) = (  mass density  ) × (  mean velocity of mass flow  ) j = (  momentum density  ) = T j 0 . {:[T^(0j)=(" energy flux ")],[=(" energy density ")xx(" mean velocity of energy flow ")^(j)],[(5.24)=(" mass density ")xx(" mean velocity of mass flow ")^(j)],[=(" momentum density ")=T^(j0).]:}\begin{align*} T^{0 j} & =(\text { energy flux }) \\ & =(\text { energy density }) \times(\text { mean velocity of energy flow })^{j} \\ & =(\text { mass density }) \times(\text { mean velocity of mass flow })^{j} \tag{5.24}\\ & =(\text { momentum density })=T^{j 0} . \end{align*}T0j=( energy flux )=( energy density )×( mean velocity of energy flow )j(5.24)=( mass density )×( mean velocity of mass flow )j=( momentum density )=Tj0.
Only the stress tensor T j k T j k T^(jk)T^{j k}Tjk remains. For it, one uses the same standard argument as in Newtonian theory. Consider a very small cube, of side L L LLL, mass-energy T 00 L 3 T 00 L 3 T^(00)L^(3)T^{00} L^{3}T00L3,
and moment of inertia T 00 L 5 T 00 L 5 ∼T^(00)L^(5)\sim T^{00} L^{5}T00L5. With the space coordinates centered at the cube, the expression for the z z zzz-component of torque exerted on the cube by its surroundings is
τ z = ( T y x L 2 ) ( -component of force on + x face ) ( L / 2 ) lever arm to + x face ) + ( T y x L 2 ) ( y -component of force on x face ) ( L / 2 ) lever arm to x face ) ( T x y L 2 ) ( x -component of force on + y face ) ( L / 2 ) lever arm to + y face ) ( T x y L 2 ) ( x -component of force on y face ) ( L / 2 ) ( lever arm to y face ) = ( T x y T y x ) L 3 . τ z = T y x L 2 -component   of force on  + x  face  ( L / 2 )  lever   arm to  + x  face  + T y x L 2 y -component   of force on  x  face  ( L / 2 )  lever   arm to  x  face  T x y L 2 x -component   of force on  + y  face  ( L / 2 )  lever   arm to  + y  face  T x y L 2 x -component   of force on  y  face  ( L / 2 )  lever   arm to  y  face  = T x y T y x L 3 {:[tau^(z)=ubrace((-T^(yx)L^(2))ubrace)_((["-component "],[" of force on "],[+x" face "]))ubrace((L//2)ubrace)_({:[" lever "],[" arm to "],[+x" face "]))+ubrace((T^(yx)L^(2))ubrace)_(([y"-component "],[" of force on "],[-x" face "]))ubrace((-L//2)ubrace)_({:[" lever "],[" arm to "],[-x" face "]))],[-ubrace((-T^(xy)L^(2))ubrace)_(([x"-component "],[" of force on "],[+y" face "]))ubrace((L//2)ubrace)_({:[" lever "],[" arm to "],[+y" face "]))-ubrace((T^(xy)L^(2))ubrace)_(([x"-component "],[" of force on "],[-y" face "]))ubrace((-L//2)ubrace)_(([" lever "],[" arm to "],[-y" face "]))],[=(T^(xy)-T^(yx))L^(3)". "]:}\begin{aligned} & \tau^{z}=\underbrace{\left(-T^{y x} L^{2}\right)}_{\left(\begin{array}{c} \text {-component } \\ \text { of force on } \\ +x \text { face } \end{array}\right)} \underbrace{(L / 2)}_{\left.\begin{array}{c} \text { lever } \\ \text { arm to } \\ +x \text { face } \end{array}\right)}+\underbrace{\left(T^{y x} L^{2}\right)}_{\left(\begin{array}{c} y \text {-component } \\ \text { of force on } \\ -x \text { face } \end{array}\right)} \underbrace{(-L / 2)}_{\left.\begin{array}{c} \text { lever } \\ \text { arm to } \\ -x \text { face } \end{array}\right)} \\ & -\underbrace{\left(-T^{x y} L^{2}\right)}_{\left(\begin{array}{c} x \text {-component } \\ \text { of force on } \\ +y \text { face } \end{array}\right)} \underbrace{(L / 2)}_{\left.\begin{array}{c} \text { lever } \\ \text { arm to } \\ +y \text { face } \end{array}\right)}-\underbrace{\left(T^{x y} L^{2}\right)}_{\left(\begin{array}{c} x \text {-component } \\ \text { of force on } \\ -y \text { face } \end{array}\right)} \underbrace{(-L / 2)}_{\left(\begin{array}{c} \text { lever } \\ \text { arm to } \\ -y \text { face } \end{array}\right)} \\ & =\left(T^{x y}-T^{y x}\right) L^{3} \text {. } \end{aligned}τz=(TyxL2)(-component  of force on +x face )(L/2) lever  arm to +x face )+(TyxL2)(y-component  of force on x face )(L/2) lever  arm to x face )(TxyL2)(x-component  of force on +y face )(L/2) lever  arm to +y face )(TxyL2)(x-component  of force on y face )(L/2)( lever  arm to y face )=(TxyTyx)L3
Since the torque decreases only as L 3 L 3 L^(3)L^{3}L3 with decreasing L L LLL, while the moment of inertia decreases as L 5 L 5 L^(5)L^{5}L5, the torque will set an arbitrarily small cube into arbitrarily great angular acceleration-which is absurd. To avoid this, the stresses distribute themselves so the torque vanishes:
T y x = T x y . T y x = T x y . T^(yx)=T^(xy).T^{y x}=T^{x y} .Tyx=Txy.
Put differently, if the stresses were not so distributed, the resultant infinite angular accelerations would instantaneously redistribute them back to equilibrium. This condition of torque balance, repeated for all other pairs of directions, is equivalent to symmetry of the stresses:
(5.25) T j k = T k j . (5.25) T j k = T k j . {:(5.25)T^(jk)=T^(kj).:}\begin{equation*} T^{j k}=T^{k j} . \tag{5.25} \end{equation*}(5.25)Tjk=Tkj.

§5.8. CONSERVATION OF 4-MOMENTUM: INTEGRAL FORMULATION

Energy-momentum conservation has been a cornerstone of physics for more than a century. Nowhere does its essence shine forth so clearly as in Einstein's geometric formulation of it (Figure 5.3,a). There one examines a four-dimensional region of spacetime V V V\mathscr{V}V bounded by a closed, three-dimensional surface V V delV\partial \mathscr{V}V. As particles and fields flow into V V V\mathscr{V}V and later out, they carry 4-momentum. Inside V V V\mathscr{V}V the particles collide, break up, radiate; radiation propagates, jiggles particles, produces pairs. But at each stage in this complex maze of physical processes, total energy-momentum is conserved. The energy-momentum lost by particles goes into fields; the energymomentum lost by fields goes into particles. So finally, when the "river" of 4-momentum exits from V V V\mathscr{V}V, it carries out precisely the same energy-momentum as it carried in.
Restate this equality by asking for the total flux of 4-momentum outward across V V delV\partial \mathscr{V}V. Count inflowing 4 -momentum negatively. Then "inflow equals outflow" means "total outflow vanishes":
Integral conservation law for 4-momentum:
r T d 3 Σ = 0 r T d 3 Σ = 0 oint_(del r*)T*d^(3)Sigma=0\oint_{\partial r \cdot} \boldsymbol{T} \cdot d^{3} \boldsymbol{\Sigma}=0rTd3Σ=0

(a)

V = S 2 S 1 V = S 2 S 1 del*V=S_(2)-S_(1)\partial \cdot V=S_{2}-S_{1}V=S2S1
(b)

(d)
V = ( V 2 V 1 ) = S ¯ S V = V 2 V 1 = S ¯ S delV=del(V_(2)-V_(1))= bar(S)-S\partial \mathscr{V}=\partial\left(\mathscr{V}_{2}-\mathscr{V}_{1}\right)=\bar{S}-SV=(V2V1)=S¯S
(c)
V = S ¯ S + T V = S ¯ S + T delV= bar(S)-S+T\partial \mathscr{V}=\bar{S}-S+\mathscr{T}V=S¯S+T
(e)
Figure 5.3.
(a) A four-dimensional region of spacetime V V V\mathscr{V}V bounded by a closed three-dimensional surface V V delV\partial \mathscr{V}V. The positive sense of V V delV\partial \mathscr{V}V is defined to be everywhere outward (away from V V V\mathscr{V}V ). Conservation of energymomentum demands that every bit of 4 -momentum which flows into V V V\mathscr{V}V through V V delV\partial \mathscr{V}V must somewhere flow back out; none can get lost inside; the interior contains no "sinks." Equivalently, the total flux of 4-momentum across V V delV\partial \mathscr{V}V in the positive (outward) sense must be zero:
γ T μ α d 3 Σ α = 0 . γ T μ α d 3 Σ α = 0 . oint_(del gamma)T^(mu alpha)d^(3)Sigma_(alpha)=0.\oint_{\partial \gamma} T^{\mu \alpha} d^{3} \Sigma_{\alpha}=0 .γTμαd3Σα=0.
Figures (b), (c), (d), and (e) depict examples to which the text applies this law of conservation of 4 -momentum. All symbols V V V\mathcal{V}V (or S S SSS ) in these figures mean spacetime volumes (or spacelike 3 -volumes) with standard orientations. The dotted arrows indicate the positive sense of the closed surface V V delV\partial \mathscr{V}V used in the text's discussion of 4-momentum conservation. How V V delV\partial \mathscr{V}V is constructed from the surfaces S S SSS and T T T\mathscr{T}T is indicated by formulas below the figures. For example, in case (b), V = S 2 S 1 V = S 2 S 1 delV=S_(2)-S_(1)\partial \mathscr{V}=S_{2}-S_{1}V=S2S1 means that V V delV\partial \mathscr{V}V is made by joining together S 2 S 2 S_(2)S_{2}S2 with its standard orientation and S 1 S 1 S_(1)S_{1}S1 with reversed orientation.
Total flux of 4-momentum outward across a closed three-dimensional surface must vanish.
To calculate the total outward flux in the most elementary of fashions, approximate the closed 3 -surface V V delV\partial \mathscr{V}V by a large number of flat 3 -volumes ("boiler plates") with positive direction oriented outward (away from V V V\mathscr{V}V ). Then
(5.27) p total out = boiler plates A T ( , Σ ( A ) ) = 0 , (5.27) p total out  = boiler plates  A T , Σ ( A ) = 0 , {:(5.27)p_("total out ")=sum_("boiler plates "A)T(dots,Sigma_((A)))=0",":}\begin{equation*} \boldsymbol{p}_{\text {total out }}=\sum_{\text {boiler plates } A} \boldsymbol{T}\left(\ldots, \boldsymbol{\Sigma}_{(A)}\right)=0, \tag{5.27} \end{equation*}(5.27)ptotal out =boiler plates AT(,Σ(A))=0,
where Σ ( 1 ) Σ ( 1 ) Sigma_((1))\boldsymbol{\Sigma}_{(1)}Σ(1) is the volume 1 -form of boiler plate A A AAA. Equivalently, in component notation
(5.27') p total out μ = A T μ α Σ ( A ) α (5.27') p total out  μ = A T μ α Σ ( A ) α {:(5.27')p_("total out ")^(mu)=sum_(A)T^(mu alpha)Sigma_((A)alpha):}\begin{equation*} p_{\text {total out }}^{\mu}=\sum_{A} T^{\mu \alpha} \Sigma_{(A) \alpha} \tag{5.27'} \end{equation*}(5.27')ptotal out μ=ATμαΣ(A)α
To be slightly more sophisticated about the calculation, take the limit as the number of boiler plates goes to infinity and their sizes go to zero. The result is an integral (Box 5.3, at the end of this section),
(5.28) p totalout μ = V T μ α d 3 Σ α = 0 (5.28) p totalout  μ = V T μ α d 3 Σ α = 0 {:(5.28)p_("totalout ")^(mu)=oint_(del V)T^(mu alpha)d^(3)Sigma_(alpha)=0:}\begin{equation*} p_{\text {totalout }}^{\mu}=\oint_{\partial V} T^{\mu \alpha} d^{3} \Sigma_{\alpha}=0 \tag{5.28} \end{equation*}(5.28)ptotalout μ=VTμαd3Σα=0
Think of this (like all component equations) as a convenient way to express a coordinate-independent statement:
(5.29) p total out = V r T d 3 Σ = 0 (5.29) p total out  = V r T d 3 Σ = 0 {:(5.29)p_("total out ")=oint_(del Vr)T*d^(3)Sigma=0:}\begin{equation*} \boldsymbol{p}_{\text {total out }}=\oint_{\partial V r} \boldsymbol{T} \cdot d^{3} \boldsymbol{\Sigma}=0 \tag{5.29} \end{equation*}(5.29)ptotal out =VrTd3Σ=0
To be more sophisticated yet (not recommended on first reading of this book) and to simplify the computations in practical cases, interpret the integrands as exterior differential forms (Box 5.4, at the end of this section).
But however one calculates it, and however one interprets the integrands, the statement of the result is simple: the total flux of 4 -momentum outward across a closed 3 -surface must vanish.
Several special cases of this "integral conservation law," shown in Figure 5.3, are instructive. There shown, in addition to the general case (a), are:

Case (b)

The closed 3-surface V V delV\partial \mathscr{V}V is made up of two slices taken at constant time t t ttt of a specific Lorentz frame, plus timelike surfaces at "infinity" that join the two slices together. The surfaces at infinity do not contribute to V T μ α d 3 Σ α V T μ α d 3 Σ α oint_(del V)T^(mu alpha)d^(3)Sigma_(alpha)\oint_{\partial V} T^{\mu \alpha} d^{3} \Sigma_{\alpha}VTμαd3Σα if the stress-energy tensor dies out rapidly enough there. The boundary V V delV\partial \mathscr{V}V of the standard-oriented 4 -volume V V V\mathscr{V}V, by definition, has its positive sense away from V V V\mathscr{V}V. This demands nonstandard
orientation of S 1 S 1 S_(1)S_{1}S1 (positive sense toward past), as is indicated by writing V = V = delV=\partial \mathscr{V}=V= S 2 S 1 S 2 S 1 S_(2)-S_(1)S_{2}-S_{1}S2S1; and it produces a sign flip in the evaluation of the hypersurface integral
0 = V T α μ d 3 Σ μ = S 1 T α 0 d x d y d z + S 2 T α 0 d x d y d z 0 = V T α μ d 3 Σ μ = S 1 T α 0 d x d y d z + S 2 T α 0 d x d y d z 0=oint_(del V)T^(alpha mu)d^(3)Sigma_(mu)=-int_(S_(1))T^(alpha0)dxdydz+int_(S_(2))T^(alpha0)dxdydz0=\oint_{\partial V} T^{\alpha \mu} d^{3} \Sigma_{\mu}=-\int_{S_{1}} T^{\alpha 0} d x d y d z+\int_{S_{2}} T^{\alpha 0} d x d y d z0=VTαμd3Σμ=S1Tα0dxdydz+S2Tα0dxdydz
Because T α 0 T α 0 T^(alpha0)T^{\alpha 0}Tα0 is the density of 4-momentum, this equation says
(5.30) ( total 4-momentum in all of space at time t 1 ) = S 1 T α 0 d x d y d z = ( total 4-momentum in all of space at time t 2 ) = S 2 T α 0 d x d y d z (5.30) (  total 4-momentum in   all of space at time  t 1 ) = S 1 T α 0 d x d y d z = (  total 4-momentum in   all of space at time  t 2 ) = S 2 T α 0 d x d y d z {:[(5.30)((" total 4-momentum in ")/(" all of space at time "t_(1)))=int_(S_(1))T^(alpha0)dxdydz],[=((" total 4-momentum in ")/(" all of space at time "t_(2)))=int_(S_(2))T^(alpha0)dxdydz]:}\begin{align*} \binom{\text { total 4-momentum in }}{\text { all of space at time } t_{1}} & =\int_{S_{1}} T^{\alpha 0} d x d y d z \tag{5.30}\\ & =\binom{\text { total 4-momentum in }}{\text { all of space at time } t_{2}}=\int_{S_{2}} T^{\alpha 0} d x d y d z \end{align*}(5.30)( total 4-momentum in  all of space at time t1)=S1Tα0dxdydz=( total 4-momentum in  all of space at time t2)=S2Tα0dxdydz

Case (c)

Here one wants to compare hypersurface integrals over S S S\mathcal{S}S and S S ¯ bar(S)\overline{\mathcal{S}}S, which are slices of constant time, t = t = t=t=t= const and t ¯ = t ¯ = bar(t)=\bar{t}=t¯= const in two different Lorentz frames. To form a closed surface, one adds time-like hypersurfaces at infinity and assumes they do not contribute to the integral. The orientations fit together smoothly and give a closed surface
V = S ¯ S + ( surfaces at infinity ) V = S ¯ S + (  surfaces at infinity  ) delV= bar(S)-S+(" surfaces at infinity ")\partial \mathscr{V}=\bar{S}-S+(\text { surfaces at infinity })V=S¯S+( surfaces at infinity )
only if one takes V = V 2 V 1 V = V 2 V 1 V=V_(2)-V_(1)\mathscr{V}=\mathscr{V}_{2}-\mathscr{V}_{1}V=V2V1-i.e., only if one uses the nonstandard 4-volume orientation in V 1 V 1 V_(1)\mathscr{V}_{1}V1. (See part A. 1 of Box 5.3 for "standard" versus "non-standard" orientation.) The integral conservation law then gives
0 = S ¯ T d 3 Σ S T d 3 Σ 0 = S ¯ T d 3 Σ S T d 3 Σ 0=int_( bar(S))T*d^(3)Sigma-int_(S)T*d^(3)Sigma0=\int_{\bar{S}} \boldsymbol{T} \cdot d^{3} \boldsymbol{\Sigma}-\int_{S} \boldsymbol{T} \cdot d^{3} \boldsymbol{\Sigma}0=S¯Td3ΣSTd3Σ
or, equivalently,
(5.31) S ¯ T d 3 Σ = ( total 4 -momentum p on S ¯ ) = S T d 3 Σ = ( total 4-momentum p on S ) (5.31) S ¯ T d 3 Σ = (  total  4 -momentum  p  on  S ¯ ) = S T d 3 Σ = (  total 4-momentum  p  on  S ) {:[(5.31)int_( bar(S))T*d^(3)Sigma=(" total "4"-momentum "p" on " bar(S))],[=int_(S)T*d^(3)Sigma=(" total 4-momentum "p" on "S)]:}\begin{align*} \int_{\bar{S}} \boldsymbol{T} \cdot & d^{3} \boldsymbol{\Sigma}=(\text { total } 4 \text {-momentum } \boldsymbol{p} \text { on } \bar{S}) \tag{5.31}\\ & =\int_{S} \boldsymbol{T} \cdot d^{3} \boldsymbol{\Sigma}=(\text { total 4-momentum } \boldsymbol{p} \text { on } S) \end{align*}(5.31)S¯Td3Σ=( total 4-momentum p on S¯)=STd3Σ=( total 4-momentum p on S)
This says that observers in different Lorentz frames measure the same total 4-momentum p p p\boldsymbol{p}p. It does not mean that they measure the same components ( p α p α ¯ ) p α p α ¯ (p^(alpha)!=p^( bar(alpha)))\left(p^{\alpha} \neq p^{\bar{\alpha}}\right)(pαpα¯); rather, it means they measure the same geometric vector
p on S = p α e α = p on S ¯ = p α ¯ e α ¯ p on S = p α e α = p on S ¯ = p α ¯ e α ¯ p_(onS)=p^(alpha)e_(alpha)=p_(on bar(S))=p^( bar(alpha))e_( bar(alpha))\boldsymbol{p}_{\mathrm{on} S}=p^{\alpha} \boldsymbol{e}_{\alpha}=\boldsymbol{p}_{\mathrm{on} \bar{S}}=p^{\bar{\alpha}} \boldsymbol{e}_{\bar{\alpha}}ponS=pαeα=ponS¯=pα¯eα¯
a vector whose components are connected by the usual Lorentz transformation law
(5.32) p α = Λ α β ¯ p β ¯ (5.32) p α = Λ α β ¯ p β ¯ {:(5.32)p^(alpha)=Lambda^(alpha)_( bar(beta))p^( bar(beta)):}\begin{equation*} p^{\alpha}=\Lambda^{\alpha}{ }_{\bar{\beta}} p^{\bar{\beta}} \tag{5.32} \end{equation*}(5.32)pα=Λαβ¯pβ¯
Total 4-momentum the same in all Lorentz frames
Total 4-momentum independent of hypersurface where measured
Change with time of 4-momentum in a box equals flux of 4 -momentum across its faces
Differential conservation law for 4-momentum: T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0

Case (d)

Here the contribution to the integral comes entirely from two arbitrary spacelike hypersurfaces, S A S A S_(A)S_{A}SA and S B S B S_(B)S_{B}SB, cutting all the way across spacetime. As in cases (a) and (b), the integral form of the conservation law says
(5.33) p on S A = p on S B ; (5.33) p on  S A = p on  S B ; {:(5.33)p_("on "S_(A))=p_("on "S_(B));:}\begin{equation*} \boldsymbol{p}_{\text {on } S_{A}}=\boldsymbol{p}_{\text {on } S_{B}} ; \tag{5.33} \end{equation*}(5.33)pon SA=pon SB;
i.e., the total 4-momentum on a spacelike slice through spacetime is independent of the specific slice chosen-so long as the energy-momentum flux across the "hypersurface at infinity" connecting S A S A S_(A)S_{A}SA and S B S B S_(B)S_{B}SB is zero.

Case (e)

This case concerns a box whose walls oscillate and accelerate as time passes. The three-dimensional boundary V V delV\partial \mathscr{V}V is made up of (1) the interior S S SSS of the box, at an initial moment of time t = t = t=t=t= constant in the box's initial Lorentz frame, taken with nonstandard orientation; (2) the interior S ¯ S ¯ bar(S)\bar{S}S¯ of the box, at t ¯ = t ¯ = bar(t)=\bar{t}=t¯= constant in its final Lorentz frame, with standard orientation; (3) the 3-volume T T T\mathscr{T}T swept out by the box's two-dimensional faces between the initial and final states, with positive sense oriented outward. The integral conservation law V T d 3 Σ = 0 V T d 3 Σ = 0 int_(del V)T*d^(3)Sigma=0\int_{\partial V} \boldsymbol{T} \cdot d^{3} \boldsymbol{\Sigma}=0VTd3Σ=0 says
( total 4-momentum in box at S ¯ ) ( total 4-momentum in box at S ) (  total 4-momentum   in box at  S ¯ ) (  total 4-momentum   in box at  S ) ((" total 4-momentum ")/(" in box at "( bar(S))))-((" total 4-momentum ")/(" in box at "S))\binom{\text { total 4-momentum }}{\text { in box at } \bar{S}}-\binom{\text { total 4-momentum }}{\text { in box at } S}( total 4-momentum  in box at S¯)( total 4-momentum  in box at S)
(5.34) = ( total 4 -momentum that enters box through its faces between states S and S ¯ ) . (5.34) = (  total  4 -momentum that enters box through   its faces between states  S  and  S ¯ ) . {:(5.34)=((" total "4"-momentum that enters box through ")/(" its faces between states "S" and "( bar(S)))).:}\begin{equation*} =\binom{\text { total } 4 \text {-momentum that enters box through }}{\text { its faces between states } \mathcal{S} \text { and } \bar{S}} . \tag{5.34} \end{equation*}(5.34)=( total 4-momentum that enters box through  its faces between states S and S¯).

§5.9. CONSERVATION OF 4-MOMENTUM: DIFFERENTIAL FORMULATION

Complementary to any "integral conservation law in flat spacetime" is a "differential conservation law" with identical information content. To pass back and forth between them, one can use Gauss's theorem.
Gauss's theorem in four dimensions, applied to the law of 4-momentum conservation, converts the surface integral of T μ α T μ α T^(mu alpha)T^{\mu \alpha}Tμα into a volume integral of T μ α , α T μ α , α T^(mu alpha)_(,alpha)T^{\mu \alpha}{ }_{, \alpha}Tμα,α :
(5.35) 0 = V T μ α d 3 Σ α = V T , α μ α d t d x d y d z (5.35) 0 = V T μ α d 3 Σ α = V T , α μ α d t d x d y d z {:(5.35)0=oint_(del V)T^(mu alpha)d^(3)Sigma_(alpha)=int_(V)T_(,alpha)^(mu alpha)dtdxdydz:}\begin{equation*} 0=\oint_{\partial V} T^{\mu \alpha} d^{3} \Sigma_{\alpha}=\int_{V} T_{, \alpha}^{\mu \alpha} d t d x d y d z \tag{5.35} \end{equation*}(5.35)0=VTμαd3Σα=VT,αμαdtdxdydz
(See Box 5.3 for elementary discussion; Box 5.4 for sophisticated discussion.) If the integral of T μ α , α T μ α , α T^(mu alpha)_(,alpha)T^{\mu \alpha}{ }_{, \alpha}Tμα,α is to vanish, as demanded, for any and every 4 -volume V V V\mathscr{V}V, then T μ α , α T μ α , α T^(mu alpha)_(,alpha)T^{\mu \alpha}{ }_{, \alpha}Tμα,α must itself vanish everywhere in spacetime:
(5.36) T , α μ α = 0 ; i.e., T = 0 everywhere. (5.36) T , α μ α = 0 ;  i.e.,  T = 0  everywhere.  {:(5.36)T_(,alpha)^(mu alpha)=0;" i.e., "grad*T=0" everywhere. ":}\begin{equation*} T_{, \alpha}^{\mu \alpha}=0 ; \text { i.e., } \boldsymbol{\nabla} \cdot \boldsymbol{T}=0 \text { everywhere. } \tag{5.36} \end{equation*}(5.36)T,αμα=0; i.e., T=0 everywhere. 
Box 5.3 VOLUME INTEGRALS, SURFACE INTEGRALS, AND GAUSS'S THEOREM IN COMPONENT NOTATION

A. Volume Integrals in Spacetime

  1. By analogy with three-dimensional space, the volume of a "hyperparallelepiped" with vector edges A , B , C , D A , B , C , D A,B,C,D\boldsymbol{A}, \boldsymbol{B}, \boldsymbol{C}, \boldsymbol{D}A,B,C,D is
4-volume Ω ϵ α β γ δ A α B β C γ D δ = det A 0 A 1 A 2 A 3 B 0 B 1 B 2 B 3 C 0 C 1 C 2 C 3 D 0 D 1 D 2 D 3 = ( A B C D ) .  4-volume  Ω ϵ α β γ δ A α B β C γ D δ = det A 0 A 1 A 2 A 3 B 0 B 1 B 2 B 3 C 0 C 1 C 2 C 3 D 0 D 1 D 2 D 3 = ( A B C D ) . {:[" 4-volume "-=Omega-=epsilon_(alpha beta gamma delta)A^(alpha)B^(beta)C^(gamma)D^(delta)=det||[A^(0),A^(1),A^(2),A^(3)],[B^(0),B^(1),B^(2),B^(3)],[C^(0),C^(1),C^(2),C^(3)],[D^(0),D^(1),D^(2),D^(3)]||],[=^(**)(A^^B^^C^^D).]:}\begin{aligned} \text { 4-volume } & \equiv \Omega \equiv \epsilon_{\alpha \beta \gamma \delta} A^{\alpha} B^{\beta} C^{\gamma} D^{\delta}=\operatorname{det}\left\|\begin{array}{llll} A^{0} & A^{1} & A^{2} & A^{3} \\ B^{0} & B^{1} & B^{2} & B^{3} \\ C^{0} & C^{1} & C^{2} & C^{3} \\ D^{0} & D^{1} & D^{2} & D^{3} \end{array}\right\| \\ & ={ }^{*}(\boldsymbol{A} \wedge \boldsymbol{B} \wedge \boldsymbol{C} \wedge \boldsymbol{D}) . \end{aligned} 4-volume ΩϵαβγδAαBβCγDδ=detA0A1A2A3B0B1B2B3C0C1C2C3D0D1D2D3=(ABCD).
Here, as for 3-volumes, orientation matters; interchange of any two edges reverses the sign of Ω Ω Omega\OmegaΩ. The standard orientation for any 4 -volume is the one which makes Ω Ω Omega\OmegaΩ positive; thus, e 0 e 1 e 2 e 3 e 0 e 1 e 2 e 3 e_(0)^^e_(1)^^e_(2)^^e_(3)\boldsymbol{e}_{0} \wedge \boldsymbol{e}_{1} \wedge \boldsymbol{e}_{2} \wedge \boldsymbol{e}_{3}e0e1e2e3 has standard orientation if e 0 e 0 e_(0)\boldsymbol{e}_{0}e0 points toward the future and e 1 , e 2 , e 3 e 1 , e 2 , e 3 e_(1),e_(2),e_(3)\boldsymbol{e}_{1}, \boldsymbol{e}_{2}, \boldsymbol{e}_{3}e1,e2,e3 are a righthanded triad.
2. The "volume element" whose edges in a specific, standard-oriented Lorentz frame are
A α = ( Δ t , 0 , 0 , 0 ) , B α = ( 0 , Δ x , 0 , 0 ) , C α = ( 0 , 0 , Δ y , 0 ) , D α = ( 0 , 0 , 0 , Δ z ) A α = ( Δ t , 0 , 0 , 0 ) , B α = ( 0 , Δ x , 0 , 0 ) , C α = ( 0 , 0 , Δ y , 0 ) , D α = ( 0 , 0 , 0 , Δ z ) A^(alpha)=(Delta t,0,0,0),B^(alpha)=(0,Delta x,0,0),C^(alpha)=(0,0,Delta y,0),D^(alpha)=(0,0,0,Delta z)A^{\alpha}=(\Delta t, 0,0,0), B^{\alpha}=(0, \Delta x, 0,0), C^{\alpha}=(0,0, \Delta y, 0), D^{\alpha}=(0,0,0, \Delta z)Aα=(Δt,0,0,0),Bα=(0,Δx,0,0),Cα=(0,0,Δy,0),Dα=(0,0,0,Δz)
has a 4 -volume, according to the above definition, given by
Δ 4 Ω = ϵ 0123 Δ t Δ x Δ y Δ z = Δ t Δ x Δ y Δ z . Δ 4 Ω = ϵ 0123 Δ t Δ x Δ y Δ z = Δ t Δ x Δ y Δ z . Delta^(4)Omega=epsilon_(0123)Delta t Delta x Delta y Delta z=Delta t Delta x Delta y Delta z.\Delta^{4} \Omega=\epsilon_{0123} \Delta t \Delta x \Delta y \Delta z=\Delta t \Delta x \Delta y \Delta z .Δ4Ω=ϵ0123ΔtΔxΔyΔz=ΔtΔxΔyΔz.
  1. Thus, the volume integral of a tensor S S S\boldsymbol{S}S over a four-dimensional region V V V\mathscr{V}V of spacetime, defined as
M Lim ( number of elementary volumes ( elementary volumes C in V ) S at center of C ( volume of a ) , M  Lim   number of   elementary   volumes   elementary   volumes  C  in  V S at center of  C (  volume of  a ) , M-=_({:" Lim ":}rarr oo_(([" number of "],[" elementary "],[" volumes "])_(([" elementary "],[" volumes "C],[" in "V]))quadS_("at center of "C)(" volume of "a),)\boldsymbol{M} \equiv \underset{\substack{\text { Lim }} \underset{\left(\begin{array}{c} \text { number of } \\ \text { elementary } \\ \text { volumes } \end{array}\right.}{\rightarrow \infty} \underset{\left(\begin{array}{c} \text { elementary } \\ \text { volumes } \mathscr{C} \\ \text { in } \mathscr{V} \end{array}\right)}{ } \quad \boldsymbol{S}_{\text {at center of } \mathscr{C}}(\text { volume of } \mathscr{a}),}{ }M Lim ( number of  elementary  volumes ( elementary  volumes C in V)Sat center of C( volume of a),
can be calculated in a Lorentz frame by
M α β γ = γ S α β γ d 4 Ω = γ S α β γ d t d x d y d z M α β γ = γ S α β γ d 4 Ω = γ S α β γ d t d x d y d z M^(alpha)_(beta gamma)=int_(gamma)S^(alpha)_(beta gamma)d^(4)Omega=int_(gamma)S^(alpha)_(beta gamma)dtdxdydzM^{\alpha}{ }_{\beta \gamma}=\int_{\gamma} S^{\alpha}{ }_{\beta \gamma} d^{4} \Omega=\int_{\gamma} S^{\alpha}{ }_{\beta \gamma} d t d x d y d zMαβγ=γSαβγd4Ω=γSαβγdtdxdydz

Box 5.3 (continued)

B. Integrals over 3-Surfaces in Spacetime

  1. Introduce arbitrary coordinates a , b , c a , b , c a,b,ca, b, ca,b,c on the three-dimensional surface. The elementary volume bounded by coordinate surfaces
    a 0 < a < a 0 + Δ a , b 0 < b < b 0 + Δ b a 0 < a < a 0 + Δ a , b 0 < b < b 0 + Δ b a_(0) < a < a_(0)+Delta a,quadb_(0) < b < b_(0)+Delta ba_{0}<a<a_{0}+\Delta a, \quad b_{0}<b<b_{0}+\Delta ba0<a<a0+Δa,b0<b<b0+Δb, c 0 < c < c 0 + Δ c c 0 < c < c 0 + Δ c c_(0) < c < c_(0)+Delta cc_{0}<c<c_{0}+\Delta cc0<c<c0+Δc
    has edges
A α = x α a Δ a , B β = x β b Δ b , C γ = x γ c Δ c ; A α = x α a Δ a , B β = x β b Δ b , C γ = x γ c Δ c ; A^(alpha)=(delx^(alpha))/(del a)Delta a,B^(beta)=(delx^(beta))/(del b)Delta b,C^(gamma)=(delx^(gamma))/(del c)Delta c;A^{\alpha}=\frac{\partial x^{\alpha}}{\partial a} \Delta a, B^{\beta}=\frac{\partial x^{\beta}}{\partial b} \Delta b, C^{\gamma}=\frac{\partial x^{\gamma}}{\partial c} \Delta c ;Aα=xαaΔa,Bβ=xβbΔb,Cγ=xγcΔc;

so its volume 1-form is
Δ 3 Σ μ = ϵ μ α β γ x α a x β b x γ c Δ a Δ b Δ c . Δ 3 Σ μ = ϵ μ α β γ x α a x β b x γ c Δ a Δ b Δ c . Delta^(3)Sigma_(mu)=epsilon_(mu alpha beta gamma)(delx^(alpha))/(del a)(delx^(beta))/(del b)(delx^(gamma))/(del c)Delta a Delta b Delta c.\Delta^{3} \Sigma_{\mu}=\epsilon_{\mu \alpha \beta \gamma} \frac{\partial x^{\alpha}}{\partial a} \frac{\partial x^{\beta}}{\partial b} \frac{\partial x^{\gamma}}{\partial c} \Delta a \Delta b \Delta c .Δ3Σμ=ϵμαβγxαaxβbxγcΔaΔbΔc.
  1. The integral of a tensor S S S\boldsymbol{S}S over the 3 -surface S S SSS thus has components
N α β = S S α β γ 3 Σ γ = S S α β γ ϵ γ μ ν λ x μ a x ν b x λ c d a d b d c . N α β = S S α β γ 3 Σ γ = S S α β γ ϵ γ μ ν λ x μ a x ν b x λ c d a d b d c . N^(alpha)_(beta)=int_(S)S^(alpha)_(beta)gamma^(3)Sigma_(gamma)=int_(S)S^(alpha)_(beta)gamma_(epsilon_(gamma mu nu lambda))(delx^(mu))/(del a)(delx^(nu))/(del b)(delx^(lambda))/(del c)dadbdc.N^{\alpha}{ }_{\beta}=\int_{S} S^{\alpha}{ }_{\beta} \gamma^{3} \Sigma_{\gamma}=\int_{S} S^{\alpha}{ }_{\beta} \gamma_{\epsilon_{\gamma \mu \nu \lambda}} \frac{\partial x^{\mu}}{\partial a} \frac{\partial x^{\nu}}{\partial b} \frac{\partial x^{\lambda}}{\partial c} d a d b d c .Nαβ=SSαβγ3Σγ=SSαβγϵγμνλxμaxνbxλcdadbdc.
An equivalent formula involving a Jacobian is often used (see exercise 5.5):
N α β = S S α β γ 1 3 ! ϵ γ μ ν λ ( x μ , x ν , x λ ) ( a , b , c ) d a d b d c . N α β = S S α β γ 1 3 ! ϵ γ μ ν λ x μ , x ν , x λ ( a , b , c ) d a d b d c . N^(alpha)_(beta)=int_(S)S^(alpha)_(beta)^(gamma)(1)/(3!)epsilon_(gamma mu nu lambda)(del(x^(mu),x^(nu),x^(lambda)))/(del(a,b,c))dadbdc.N^{\alpha}{ }_{\beta}=\int_{S} S^{\alpha}{ }_{\beta}{ }^{\gamma} \frac{1}{3!} \epsilon_{\gamma \mu \nu \lambda} \frac{\partial\left(x^{\mu}, x^{\nu}, x^{\lambda}\right)}{\partial(a, b, c)} d a d b d c .Nαβ=SSαβγ13!ϵγμνλ(xμ,xν,xλ)(a,b,c)dadbdc.

C. Gauss's Theorem Stated

  1. Consider a bounded four-dimensional region of spacetime V V V\mathscr{V}V with closed boundary V V delV\partial \mathscr{V}V. Orient the volume 1-forms on V V delV\partial \mathscr{V}V so that the "positive sense" is away from V V V\mathscr{V}V.
  2. Choose a tensor field S S S\boldsymbol{S}S. Integrate its divergence over V V V\mathscr{V}V, and integrate it itself over V V delV\partial \mathscr{V}V. The results must be the same (Gauss's theorem):
V S α β γ , γ 4 d 4 Ω = V S α β γ d 3 Σ γ . V S α β γ , γ 4 d 4 Ω = V S α β γ d 3 Σ γ . int_(V)S^(alpha)_(beta)^(gamma),gamma^(4)d^(4)Omega=oint_(del V)S^(alpha)_(beta)^(gamma)d^(3)Sigma_(gamma).\int_{V} S^{\alpha}{ }_{\beta}{ }^{\gamma}, \gamma{ }^{4} d^{4} \Omega=\oint_{\partial V} S^{\alpha}{ }_{\beta}{ }^{\gamma} d^{3} \Sigma_{\gamma} .VSαβγ,γ4d4Ω=VSαβγd3Σγ.

D. Proof of Gauss's Theorem

  1. The indices α α alpha\alphaα and β β beta\betaβ of S α β γ S α β γ S^(alpha)_(beta)gammaS^{\alpha}{ }_{\beta} \gammaSαβγ "go along for a free ride," so one can suppress them from the proof. Then the equation to be derived is
γ ~ S γ , γ d t d x d y d z = γ S γ d 3 Σ γ . γ ~ S γ , γ d t d x d y d z = γ S γ d 3 Σ γ . int_( widetilde(gamma))S^(gamma)_(,gamma)dtdxdydz=oint_(del gamma)S^(gamma)d^(3)Sigma_(gamma).\int_{\widetilde{\gamma}} S^{\gamma}{ }_{, \gamma} d t d x d y d z=\oint_{\partial \gamma} S^{\gamma} d^{3} \Sigma_{\gamma} .γ~Sγ,γdtdxdydz=γSγd3Σγ.
  1. Since the integral of a derivative is just the original function, the volume integral of S 0 0 S 0 0 S^(0)_(0)S^{0}{ }_{0}S00 is
V S 0 , 0 d t d x d y d z = "up" S 0 d x d y d z ("down" S 0 d x d y d z V S 0 , 0 d t d x d y d z = "up"  S 0 d x d y d z ("down"  S 0 d x d y d z {:[int_(V)S^(0)_(,0)dtdxdydz],[=int_(""up" ")S^(0)dxdydz-int_((down )S^(0)dxdydz]:}\begin{aligned} & \int_{V} S^{0}{ }_{, 0} d t d x d y d z \\ & =\int_{\text {"up" }} S^{0} d x d y d z-\int_{\text {("down" }} S^{0} d x d y d z \end{aligned}VS0,0dtdxdydz="up" S0dxdydz("down" S0dxdydz

3. The surface integral τ S 0 d 3 Σ 0 τ S 0 d 3 Σ 0 int_(del tau)S^(0)d^(3)Sigma_(0)\int_{\partial \tau} S^{0} d^{3} \Sigma_{0}τS0d3Σ0 can be reduced to the same set of terms:
a. Use x , y , z x , y , z x,y,zx, y, zx,y,z as coordinates on V V delV\partial \mathscr{V}V. On the "up" side, d 3 Σ 0 d 3 Σ 0 d^(3)Sigma_(0)d^{3} \Sigma_{0}d3Σ0 must be positive to achieve a "positive" sense pointing away from V V V\mathscr{V}V, so (see part B above)
d 3 Σ 0 = ϵ 0 α β γ x α x x β y x γ z d x d y d z = ϵ 0123 d x d y d z = d x d y d z d 3 Σ 0 = ϵ 0 α β γ x α x x β y x γ z d x d y d z = ϵ 0123 d x d y d z = d x d y d z d^(3)Sigma_(0)=epsilon_(0alpha beta gamma)(delx^(alpha))/(del x)(delx^(beta))/(del y)(delx^(gamma))/(del z)dxdydz=epsilon_(0123)dxdydz=dxdydzd^{3} \Sigma_{0}=\epsilon_{0 \alpha \beta \gamma} \frac{\partial x^{\alpha}}{\partial x} \frac{\partial x^{\beta}}{\partial y} \frac{\partial x^{\gamma}}{\partial z} d x d y d z=\epsilon_{0123} d x d y d z=d x d y d zd3Σ0=ϵ0αβγxαxxβyxγzdxdydz=ϵ0123dxdydz=dxdydz
b. On the "down" side, d 3 Σ 0 d 3 Σ 0 d^(3)Sigma_(0)d^{3} \Sigma_{0}d3Σ0 must be negative, so
d 3 Σ 0 = d x d y d z d 3 Σ 0 = d x d y d z d^(3)Sigma_(0)=-dxdydzd^{3} \Sigma_{0}=-d x d y d zd3Σ0=dxdydz
c. Hence,
r S 0 d 3 Σ 0 = "up" S 0 d x d y d z "down" S 0 d x d y d z r S 0 d 3 Σ 0 = "up"  S 0 d x d y d z "down"  S 0 d x d y d z int_(del r)S^(0)d^(3)Sigma_(0)=int_(""up" ")S^(0)dxdydz-int_(""down" ")S^(0)dxdydz\int_{\partial r} S^{0} d^{3} \Sigma_{0}=\int_{\text {"up" }} S^{0} d x d y d z-\int_{\text {"down" }} S^{0} d x d y d zrS0d3Σ0="up" S0dxdydz"down" S0dxdydz
  1. Equality is proved for the other components in the same manner. Adding components produces the result desired:
γ S γ , γ d 4 Ω = γ S γ d 3 Σ γ γ S γ , γ d 4 Ω = γ S γ d 3 Σ γ int_(gamma)S^(gamma)_(,gamma)d^(4)Omega=oint_(del gamma)S^(gamma)d^(3)Sigma_(gamma)\int_{\gamma} S^{\gamma}{ }_{, \gamma} d^{4} \Omega=\oint_{\partial \gamma} S^{\gamma} d^{3} \Sigma_{\gamma}γSγ,γd4Ω=γSγd3Σγ

FOR THE READER WHO HAS STUDIED CHAPTER 4

Box 5.4 I. EVERY INTEGRAL IS THE INTEGRAL OF A FORM.
II. THE THEOREM OF GAUSS IN THE LANGUAGE OF FORMS.
I. Every integral encountered in Chapter 5 can be interpreted as the integral of an exterior differential form. This circumstance shows up in fourtold and threctold integrals, for example, in the fact that
d 4 Ω = ε = 1 = ε 0123 d t d x d y d z d 4 Ω = ε = 1 = ε 0123 d t d x d y d z d^(4)Omega=epsi=^(**)1=epsi_(0123)dt^^dx^^dy^^dzd^{4} \Omega=\varepsilon={ }^{*} 1=\varepsilon_{0123} \boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} zd4Ω=ε=1=ε0123dtdxdydz
and
d 3 Σ μ = ε μ | α β γ | d x α d x β d x γ d 3 Σ μ = ε μ | α β γ | d x α d x β d x γ d^(3)Sigma_(mu)=epsi_(mu|alpha beta gamma|)dx^(alpha)^^dx^(beta)^^dx^(gamma)d^{3} \Sigma_{\mu}=\varepsilon_{\mu|\alpha \beta \gamma|} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma}d3Σμ=εμ|αβγ|dxαdxβdxγ
are basis 4- and 3-forms. (Recall: the indices α β γ α β γ alpha beta gamma\alpha \beta \gammaαβγ between vertical bars are to be summed only over 0 α < β < γ 3 0 α < β < γ 3 0 <= alpha < beta < gamma <= 30 \leq \alpha<\beta<\gamma \leq 30α<β<γ3.) A more extensive glossary of notations is found in C below.
II. Gauss's Theorem for a tensor integral in flat space reads
V ( S ) d 4 Ω = V S d Σ V ( S ) d 4 Ω = V S d Σ int_(V)(grad*S)d^(4)Omega=oint_(del V)S*d Sigma\int_{\mathscr{V}}(\boldsymbol{\nabla} \cdot \boldsymbol{S}) d^{4} \Omega=\oint_{\partial V} \boldsymbol{S} \cdot d \boldsymbol{\Sigma}V(S)d4Ω=VSdΣ
for any tensor, such as S = S α β γ e α ω β e γ S = S α β γ e α ω β e γ S=S^(alpha)_(beta)^(gamma)e_(alpha)oxomega^(beta)oxe_(gamma)\boldsymbol{S}=S^{\alpha}{ }_{\beta}{ }^{\gamma} \boldsymbol{e}_{\alpha} \otimes \boldsymbol{\omega}^{\beta} \otimes \boldsymbol{e}_{\gamma}S=Sαβγeαωβeγ (see Box 5.3 for component form). It is an application of the generalized Stokes Theorem (Box 4.1), and depends on the fact that the basis vectors e α e α e_(alpha)\boldsymbol{e}_{\alpha}eα and ω β ω β omega^(beta)\boldsymbol{\omega}^{\beta}ωβ of a global Lorentz frame are constants, i.e., are independent of x x xxx. The definitions follow in A ; the proof is in B .
A. Tensor-valued integrals can be defined in flat spaces because one uses constant basis vectors. Thus one defines
S d 3 Σ = e α ω β S β α d 3 Σ γ S d 3 Σ = e α ω β S β α d 3 Σ γ int S*d^(3)Sigma=e_(alpha)oxomega^(beta)intS_(beta)^(alpha)d^(3)Sigma_(gamma)\int \boldsymbol{S} \cdot d^{3} \boldsymbol{\Sigma}=\boldsymbol{e}_{\alpha} \otimes \boldsymbol{\omega}^{\beta} \int S_{\beta}^{\alpha} d^{3} \Sigma_{\gamma}Sd3Σ=eαωβSβαd3Σγ
for a tensor of the indicated rank. One justifies pulling basis vectors and forms outside the integral sign because they are constants, independent of location in spacetime. Each of the numbers S α β γ d 3 Σ γ S α β γ d 3 Σ γ intS^(alpha)_(beta)^(gamma)d^(3)Sigma_(gamma)\int S^{\alpha}{ }_{\beta}{ }^{\gamma} d^{3} \Sigma_{\gamma}Sαβγd3Σγ (for α , β = 0 , 1 , 2 , 3 α , β = 0 , 1 , 2 , 3 alpha,beta=0,1,2,3\alpha, \beta=0,1,2,3α,β=0,1,2,3 ) is then evaluated by substituting any properly oriented parametrization of the hypersurface into the 3 -form S α β γ d 3 Σ γ S α β γ d 3 Σ γ S^(alpha)_(beta)^(gamma)d^(3)Sigma_(gamma)S^{\alpha}{ }_{\beta}{ }^{\gamma} d^{3} \Sigma_{\gamma}Sαβγd3Σγ as described in Box 4.1 (arbitrary curvilinear parametrization in the part of the calculation not involving the "free indices" α α alpha\alphaα and β β beta\betaβ ). In other words, S d 3 Σ = e α ω β S α β γ d 3 Σ γ S d 3 Σ = e α ω β S α β γ d 3 Σ γ S*d^(3)Sigma=e_(alpha)oxomega^(beta)oxS^(alpha)_(beta)^(gamma)d^(3)Sigma_(gamma)\boldsymbol{S} \cdot d^{3} \boldsymbol{\Sigma}=\boldsymbol{e}_{\alpha} \otimes \boldsymbol{\omega}^{\beta} \otimes S^{\alpha}{ }_{\beta}{ }^{\gamma} d^{3} \Sigma_{\gamma}Sd3Σ=eαωβSαβγd3Σγ is considered a "tensor-valued 3-form." Under an integral sign, it is contracted with the hyperplane element tangent to the 3 -surface P ( λ 1 , λ 2 , λ 3 ) P λ 1 , λ 2 , λ 3 P(lambda^(1),lambda^(2),lambda^(3))\mathscr{P}\left(\lambda^{1}, \lambda^{2}, \lambda^{3}\right)P(λ1,λ2,λ3) of integration to form the integral
S d 3 Σ = S d 3 Σ , P λ 1 P λ 2 P λ 3 d λ 1 d λ 2 d λ 3 = e α ω β S α β γ ε γ | λ μ ν | ( x λ , x μ , x ν ) ( λ 1 , λ 2 , λ 3 ) Jacobian determinant d λ 1 d λ 2 d λ 3 . S d 3 Σ = S d 3 Σ , P λ 1 P λ 2 P λ 3 d λ 1 d λ 2 d λ 3 = e α ω β S α β γ ε γ | λ μ ν | x λ , x μ , x ν λ 1 , λ 2 , λ 3  Jacobian   determinant  d λ 1 d λ 2 d λ 3 . {:[int S*d^(3)Sigma=int(:S*d^(3)Sigma,(delP)/(dellambda^(1))^^(delP)/(dellambda^(2))^^(delP)/(dellambda^(3)):)dlambda^(1)dlambda^(2)dlambda^(3)],[=e_(alpha)oxomega^(beta)intS^(alpha)_(beta)gamma_(epsi_(gamma|lambda mu nu|))ubrace((del(x^(lambda),x^(mu),x^(nu)))/(del(lambda^(1),lambda^(2),lambda^(3)))ubrace)_({:[" Jacobian "],[" determinant "]:})dlambda^(1)dlambda^(2)dlambda^(3).]:}\begin{aligned} \int \boldsymbol{S} \cdot d^{3} \boldsymbol{\Sigma} & =\int\left\langle\boldsymbol{S} \cdot d^{3} \boldsymbol{\Sigma}, \frac{\partial \mathscr{P}}{\partial \lambda^{1}} \wedge \frac{\partial \mathscr{P}}{\partial \lambda^{2}} \wedge \frac{\partial \mathscr{P}}{\partial \lambda^{3}}\right\rangle d \lambda^{1} d \lambda^{2} d \lambda^{3} \\ & =\boldsymbol{e}_{\alpha} \otimes \boldsymbol{\omega}^{\beta} \int \boldsymbol{S}^{\alpha}{ }_{\beta} \gamma_{\varepsilon_{\gamma|\lambda \mu \nu|}} \underbrace{\frac{\partial\left(x^{\lambda}, x^{\mu}, x^{\nu}\right)}{\partial\left(\lambda^{1}, \lambda^{2}, \lambda^{3}\right)}}_{\begin{array}{c} \text { Jacobian } \\ \text { determinant } \end{array}} d \lambda^{1} d \lambda^{2} d \lambda^{3} . \end{aligned}Sd3Σ=Sd3Σ,Pλ1Pλ2Pλ3dλ1dλ2dλ3=eαωβSαβγεγ|λμν|(xλ,xμ,xν)(λ1,λ2,λ3) Jacobian  determinant dλ1dλ2dλ3.
Although constant basis vectors e α , ω β e α , ω β e_(alpha),omega^(beta)\boldsymbol{e}_{\alpha}, \boldsymbol{\omega}^{\beta}eα,ωβ derived from rectangular coordinates are essential here, a completely general parametrization of the hypersurface may be used.
B. The proof of Gauss's Theorem is a computation:
γ s d 3 Σ = e α ω β V S α β γ d 3 Σ γ ( e α , ω β are constant) = e α ω β γ d ( S α β γ d 3 Σ γ ) (Stokes Theorem) = e α ω β γ S α β γ , 1 (see below) = γ ( S ) d 4 Ω . (merely notation) γ s d 3 Σ = e α ω β V S α β γ d 3 Σ γ e α , ω β  are constant)  = e α ω β γ d S α β γ d 3 Σ γ  (Stokes Theorem)  = e α ω β γ S α β γ , 1  (see below)  = γ ( S ) d 4 Ω .  (merely notation)  {:[oint_(del gamma)s*d^(3)Sigma=e_(alpha)oxomega^(beta)oint_(del V)S^(alpha)_(beta)^(gamma)d^(3)Sigma_(gamma)(e_(alpha),omega^(beta):}" are constant) "],[=e_(alpha)oxomega^(beta)int_(gamma)d(S^(alpha)_(beta)^(gamma)d^(3)Sigma_(gamma))" (Stokes Theorem) "],[=e_(alpha)oxomega^(beta)int_(gamma)S^(alpha)_(beta)^(gamma)","^(')^(**)1" (see below) "],[=int_(gamma)(grad*S)d^(4)Omega." (merely notation) "]:}\begin{aligned} \oint_{\partial \gamma} \boldsymbol{s} \cdot d^{3} \boldsymbol{\Sigma} & =\boldsymbol{e}_{\alpha} \otimes \boldsymbol{\omega}^{\beta} \oint_{\partial V} S^{\alpha}{ }_{\beta}{ }^{\gamma} d^{3} \Sigma_{\gamma} & \left(\boldsymbol{e}_{\alpha}, \boldsymbol{\omega}^{\beta}\right. \text { are constant) } \\ & =\boldsymbol{e}_{\alpha} \otimes \boldsymbol{\omega}^{\beta} \int_{\gamma} \boldsymbol{d}\left(S^{\alpha}{ }_{\beta}{ }^{\gamma} d^{3} \Sigma_{\gamma}\right) & \text { (Stokes Theorem) } \\ & =\boldsymbol{e}_{\alpha} \otimes \boldsymbol{\omega}^{\beta} \int_{\gamma} S^{\alpha}{ }_{\beta}{ }^{\gamma},{ }^{\prime}{ }^{*} 1 & \text { (see below) } \\ & =\int_{\gamma}(\boldsymbol{\nabla} \cdot \boldsymbol{S}) d^{4} \Omega . & \text { (merely notation) } \end{aligned}γsd3Σ=eαωβVSαβγd3Σγ(eα,ωβ are constant) =eαωβγd(Sαβγd3Σγ) (Stokes Theorem) =eαωβγSαβγ,1 (see below) =γ(S)d4Ω. (merely notation) 
The missing computational step above is
d ( S α β γ d 3 Σ γ ) = ( S α β γ / x ρ ) d x ρ d 3 Σ γ = ( S α β γ / x γ ) 1 . d S α β γ d 3 Σ γ = S α β γ / x ρ d x ρ d 3 Σ γ = S α β γ / x γ 1 . {:[d(S^(alpha)_(beta)^(gamma)d^(3)Sigma_(gamma))=(delS^(alpha)_(beta)^(gamma)//delx^(rho))dx^(rho)^^d^(3)Sigma_(gamma)],[=(delS^(alpha)_(beta)^(gamma)//delx^(gamma))^(**)1.]:}\begin{aligned} \boldsymbol{d}\left(S^{\alpha}{ }_{\beta}{ }^{\gamma} d^{3} \Sigma_{\gamma}\right) & =\left(\partial S^{\alpha}{ }_{\beta}{ }^{\gamma} / \partial x^{\rho}\right) \boldsymbol{d} x^{\rho} \wedge d^{3} \Sigma_{\gamma} \\ & =\left(\partial S^{\alpha}{ }_{\beta}{ }^{\gamma} / \partial x^{\gamma}\right)^{*} 1 . \end{aligned}d(Sαβγd3Σγ)=(Sαβγ/xρ)dxρd3Σγ=(Sαβγ/xγ)1.
Here the first step uses d ( d 3 Σ γ ) = 0 d d 3 Σ γ = 0 d(d^(3)Sigma_(gamma))=0\boldsymbol{d}\left(d^{3} \Sigma_{\gamma}\right)=0d(d3Σγ)=0 (which follows from ε μ α β γ = ε μ α β γ = epsi_(mu alpha beta gamma)=\varepsilon_{\mu \alpha \beta \gamma}=εμαβγ= const in flat spacetime). The second step uses
d x ρ d 3 Σ γ = δ γ ρ 1 d x ρ d 3 Σ γ = δ γ ρ 1 dx^(rho)^^d^(3)Sigma_(gamma)=delta_(gamma)^(rho)**1\boldsymbol{d} x^{\rho} \wedge d^{3} \Sigma_{\gamma}=\delta_{\gamma}^{\rho} * 1dxρd3Σγ=δγρ1
[Write the lefthand side of this identity as ε γ | μ ν λ | d x ρ d x μ d x ν d x λ ε γ | μ ν λ | d x ρ d x μ d x ν d x λ epsi_(gamma|mu nu lambda|)dx^(rho)^^dx^(mu)^^dx^(nu)^^dx^(lambda)\varepsilon_{\gamma|\mu \nu \lambda|} \boldsymbol{d} x^{\rho} \wedge \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{\nu} \wedge \boldsymbol{d} x^{\lambda}εγ|μνλ|dxρdxμdxνdxλ. The only possible non-zero term in the sum over μ ν λ μ ν λ mu nu lambda\mu \nu \lambdaμνλ is the one with μ < ν < λ μ < ν < λ mu < nu < lambda\mu<\nu<\lambdaμ<ν<λ all different from ρ ρ rho\rhoρ. The righthand side is the value of this term.]
C. Glossary of notations.
Charge density 3-form:
J = J μ d 3 Σ μ = J d 3 Σ = J μ ε μ α β γ ( J ) α β γ d 3 Σ μ d x α d x β d x γ / 3 ! J = J μ d 3 Σ μ = J d 3 Σ = J μ ε μ α β γ J α β γ d 3 Σ μ d x α d x β d x γ / 3 ! {:[**J=J^(mu)d^(3)Sigma_(mu)=J*d^(3)Sigma],[=ubrace(J^(mu)epsi_(mu alpha beta gamma)ubrace)_((^(**)J)_(alpha beta gamma))ubrace(ubrace)_(d^(3)Sigma_(mu))dx^(alpha)^^dx^(beta)^^dx^(gamma)//3!]:}\begin{aligned} * \boldsymbol{J}=J^{\mu} d^{3} \Sigma_{\mu} & =\boldsymbol{J} \cdot d^{3} \boldsymbol{\Sigma} \\ & =\underbrace{J^{\mu} \varepsilon_{\mu \alpha \beta \gamma}}_{\left({ }^{*} J\right)_{\alpha \beta \gamma}} \underbrace{}_{d^{3} \Sigma_{\mu}} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma} / 3! \end{aligned}J=Jμd3Σμ=Jd3Σ=Jμεμαβγ(J)αβγd3Σμdxαdxβdxγ/3!
Maxwell and Faraday 2-forms:
F = 1 2 F μ ν d 2 S μ ν F = 1 2 F μ ν d x μ d x ν . F = 1 2 F μ ν d 2 S μ ν F = 1 2 F μ ν d x μ d x ν . {:[**F=(1)/(2)F^(mu nu)d^(2)S_(mu nu)],[F=(1)/(2)F_(mu nu)dx^(mu)^^dx^(nu).]:}\begin{aligned} * \boldsymbol{F} & =\frac{1}{2} F^{\mu \nu} d^{2} S_{\mu \nu} \\ \boldsymbol{F} & =\frac{1}{2} F_{\mu \nu} \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{\nu} . \end{aligned}F=12Fμνd2SμνF=12Fμνdxμdxν.
Basis 2-forms:
d x α d x β ; (one way to label) d 2 S μ ν = ε μ ν | α β | d x α d x β . (dual way to label) d x α d x β ;  (one way to label)  d 2 S μ ν = ε μ ν | α β | d x α d x β .  (dual way to label)  {:[dx^(alpha)^^dx^(beta);," (one way to label) "],[d^(2)S_(mu nu)=epsi_(mu nu|alpha beta|)dx^(alpha)^^dx^(beta).," (dual way to label) "]:}\begin{array}{lr} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} ; & \text { (one way to label) } \\ d^{2} S_{\mu \nu}=\varepsilon_{\mu \nu|\alpha \beta|} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} . & \text { (dual way to label) } \end{array}dxαdxβ; (one way to label) d2Sμν=εμν|αβ|dxαdxβ. (dual way to label) 
Energy-momentum density 3-form:
dual on last index, ( T ) μ α β γ = T 3 T μ ν ε ν α β γ e μ T μ ν d 3 Σ ν T ;  dual on last index,  T μ α β γ = T 3 T μ ν ε ν α β γ e μ T μ ν d 3 Σ ν T ; obrace(" dual on last index, "(^(**)T)^(mu)_(alpha beta gamma)=T^(3)T^(mu nu)epsi_(nu alpha beta gamma))^(-=e_(mu)T^(mu nu)d^(3)Sigma_(nu)-=^(**)T;)\overbrace{\text { dual on last index, }\left({ }^{*} T\right)^{\mu}{ }_{\alpha \beta \gamma}=T^{3} T^{\mu \nu} \varepsilon_{\nu \alpha \beta \gamma}}^{\equiv \boldsymbol{e}_{\mu} T^{\mu \nu} d^{3} \Sigma_{\nu} \equiv{ }^{*} \boldsymbol{T} ;} dual on last index, (T)μαβγ=T3TμνεναβγeμTμνd3ΣνT;
Angular momentum density 3-form:
J d 3 Σ 1 2 e μ e ν g μ ν α d 3 Σ α g ( g ) μ ν α β γ = g μ ν λ ε λ α β γ . J d 3 Σ 1 2 e μ e ν g μ ν α d 3 Σ α g g μ ν α β γ = g μ ν λ ε λ α β γ . {:[J*d^(3)Sigma-=(1)/(2)e_(mu)^^e_(nu)g^(mu nu alpha)d^(3)Sigma_(alpha)-=^(**)g],[(^(**)g)^(mu nu)_(alpha beta gamma)=g^(mu nu lambda)epsi_(lambda alpha beta gamma).]:}\begin{aligned} \mathscr{J} \cdot d^{3} \boldsymbol{\Sigma} \equiv \frac{1}{2} \boldsymbol{e}_{\mu} \wedge \boldsymbol{e}_{\nu} \mathscr{g}^{\mu \nu \alpha} d^{3} \Sigma_{\alpha} & \equiv{ }^{*} \boldsymbol{\mathscr { g }} \\ \left({ }^{*} \mathfrak{g}\right)^{\mu \nu}{ }_{\alpha \beta \gamma} & =\mathcal{g}^{\mu \nu \lambda} \varepsilon_{\lambda \alpha \beta \gamma} . \end{aligned}Jd3Σ12eμeνgμναd3Σαg(g)μναβγ=gμνλελαβγ.
(In the frame-independent equation T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0, one need not worry about which slot of T T T\boldsymbol{T}T to take the divergence on; the slots are symmetric, so either can be used.)
The equation T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0 is the differential formulation of the law of 4-momentum conservation. It is also called the equation of motion for stress-energy, because it places constraints on the dynamic evolution of the stress-energy tensor. To examine these constraints for simple systems is to realize the beauty and power of the equation T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0.

§5.10. SAMPLE APPLICATIONS OF T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=\mathbf{0}T=0

The equation of motion T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0 makes contact with the classical (Newtonian) equations of hydrodynamics, when applied to a nearly Newtonian fluid. Such a fluid has low velocities relative to the Lorentz frame used, | v j | 1 v j 1 |v^(j)|≪1\left|v^{j}\right| \ll 1|vj|1; and in its rest frame its pressure is small compared to its density of mass-energy, p / ρ = p / ρ c 2 1 p / ρ = p / ρ c 2 1 p//rho=p//rhoc^(2)≪1p / \rho=p / \rho c^{2} \ll 1p/ρ=p/ρc21. For example, the air in a hurricane has
| v j | 100 km / hour 3 , 000 cm / sec 10 7 c = 10 7 1 , p ρ 1 atmosphere 10 3 g / cm 3 10 6 dynes / cm 2 10 3 g / cm 3 = 10 9 cm 2 sec 2 10 12 c 2 = 10 12 1 . v j 100 km /  hour  3 , 000 cm / sec 10 7 c = 10 7 1 , p ρ 1  atmosphere  10 3 g / cm 3 10 6  dynes  / cm 2 10 3 g / cm 3 = 10 9 cm 2 sec 2 10 12 c 2 = 10 12 1 . {:[|v^(j)|∼100km//" hour "∼3","000cm//sec∼10^(-7)c=10^(-7)≪1","],[(p)/( rho)∼(1" atmosphere ")/(10^(-3)(g)//cm^(3))∼(10^(6)" dynes "//cm^(2))/(10^(-3)(g)//cm^(3))=10^(9)(cm^(2))/(sec^(2))∼10^(-12)c^(2)=10^(-12)≪1.]:}\begin{gathered} \left|v^{j}\right| \sim 100 \mathrm{~km} / \text { hour } \sim 3,000 \mathrm{~cm} / \mathrm{sec} \sim 10^{-7} c=10^{-7} \ll 1, \\ \frac{p}{\rho} \sim \frac{1 \text { atmosphere }}{10^{-3} \mathrm{~g} / \mathrm{cm}^{3}} \sim \frac{10^{6} \text { dynes } / \mathrm{cm}^{2}}{10^{-3} \mathrm{~g} / \mathrm{cm}^{3}}=10^{9} \frac{\mathrm{~cm}^{2}}{\mathrm{sec}^{2}} \sim 10^{-12} c^{2}=10^{-12} \ll 1 . \end{gathered}|vj|100 km/ hour 3,000 cm/sec107c=1071,pρ1 atmosphere 103 g/cm3106 dynes /cm2103 g/cm3=109 cm2sec21012c2=10121.
The stress-energy tensor for such a fluid has components
(5.37a) T 00 = ( ρ + p ) u 0 u 0 p ρ , (5.37b) T 0 j = T j 0 = ( ρ + p ) u 0 u j ρ v j , (5.37c) T j k = ( ρ + p ) u j u k + p δ j k ρ v j v k + p δ j k ; (5.37a) T 00 = ( ρ + p ) u 0 u 0 p ρ , (5.37b) T 0 j = T j 0 = ( ρ + p ) u 0 u j ρ v j , (5.37c) T j k = ( ρ + p ) u j u k + p δ j k ρ v j v k + p δ j k ; {:[(5.37a)T^(00)=(rho+p)u^(0)u^(0)-p~~rho","],[(5.37b)T^(0j)=T^(j0)=(rho+p)u^(0)u^(j)~~rhov^(j)","],[(5.37c)T^(jk)=(rho+p)u^(j)u^(k)+pdelta^(jk)~~rhov^(j)v^(k)+pdelta^(jk);]:}\begin{align*} T^{00} & =(\rho+p) u^{0} u^{0}-p \approx \rho, \tag{5.37a}\\ T^{0 j} & =T^{j 0}=(\rho+p) u^{0} u^{j} \approx \rho v^{j}, \tag{5.37b}\\ T^{j k} & =(\rho+p) u^{j} u^{k}+p \delta^{j k} \approx \rho v^{j} v^{k}+p \delta^{j k} ; \tag{5.37c} \end{align*}(5.37a)T00=(ρ+p)u0u0pρ,(5.37b)T0j=Tj0=(ρ+p)u0ujρvj,(5.37c)Tjk=(ρ+p)ujuk+pδjkρvjvk+pδjk;
and the equation of motion T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0 has components
(5.38a) T 00 , 0 + T 0 j , j = ρ / t + ( ρ v ) = 0 (5.38a) T 00 , 0 + T 0 j , j = ρ / t + ( ρ v ) = 0 {:(5.38a)T^(00)_(,0)+T^(0j)_(,j)=del rho//del t+grad*(rho v)=0:}\begin{equation*} T^{00}{ }_{, 0}+T^{0 j}{ }_{, j}=\partial \rho / \partial t+\boldsymbol{\nabla} \cdot(\rho \boldsymbol{v})=0 \tag{5.38a} \end{equation*}(5.38a)T00,0+T0j,j=ρ/t+(ρv)=0
("equation of continuity");
and
T , 0 j 0 + T j k , k = ( ρ v j ) / t + ( ρ v j v k ) / x k + p / x j = 0 T , 0 j 0 + T j k , k = ρ v j / t + ρ v j v k / x k + p / x j = 0 T_(,0)^(j0)+T^(jk)_(,k)=del(rhov^(j))//del t+del(rhov^(j)v^(k))//delx^(k)+del p//delx^(j)=0T_{, 0}^{j 0}+T^{j k}{ }_{, k}=\partial\left(\rho v^{j}\right) / \partial t+\partial\left(\rho v^{j} v^{k}\right) / \partial x^{k}+\partial p / \partial x^{j}=0T,0j0+Tjk,k=(ρvj)/t+(ρvjvk)/xk+p/xj=0
or, equivalently (by combining with the equation of continuity),
(5.38b) v t + ( v ) v = 1 ρ p ( "Euler's equation"). (5.38b) v t + ( v ) v = 1 ρ p (  "Euler's equation").  {:(5.38b)(del v)/(del t)+(v*grad)v=-(1)/(rho)grad p quad(" "Euler's equation"). ":}\begin{equation*} \frac{\partial v}{\partial t}+(v \cdot \nabla) v=-\frac{1}{\rho} \nabla p \quad(\text { "Euler's equation"). } \tag{5.38b} \end{equation*}(5.38b)vt+(v)v=1ρp( "Euler's equation"). 
Box 5.5 derives and discusses these results from the Newtonian viewpoint.
As a second application of T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0, consider a composite system: a block of rubber with electrically charged beads imbedded in it, interacting with an electromagnetic field. The block of rubber vibrates, and its accelerating beads radiate electromagnetic waves; at the same time, incoming electromagnetic waves push on the beads, altering the pattern of vibration of the block of rubber. The interactions shove 4-momentum back and forth between beaded block and electromagnetic field.

Box 5.5 NEWTONIAN HYDRODYNAMICS REVIEWED

Consider a classical, nonrelativistic, perfect fluid. Apply Newton's law F = m a F = m a F=ma\boldsymbol{F}=m \boldsymbol{a}F=ma to a "fluid particle"; that is, to a small fixed mass of fluid followed in its progress through space:
d d t ( d d t ( (d)/(dt)(\frac{d}{d t}(ddt( momentum per unit mass ) = ( ) = ( )=()=()=( force per unit mass ) ) )))
= ( force per unit volume ) ( density ) = ( gradient of pressure ) ( density ) = (  force per unit volume  ) (  density  ) = (  gradient of pressure  ) (  density  ) =((" force per unit volume "))/((" density "))=(-(" gradient of pressure "))/((" density "))=\frac{(\text { force per unit volume })}{(\text { density })}=\frac{-(\text { gradient of pressure })}{(\text { density })}=( force per unit volume )( density )=( gradient of pressure )( density )
or
(1) d v d t = 1 ρ p (1) d v d t = 1 ρ p {:(1)(dv)/(dt)=-(1)/(rho)grad p:}\begin{equation*} \frac{d v}{d t}=-\frac{1}{\rho} \nabla p \tag{1} \end{equation*}(1)dvdt=1ρp
Translate from time-rate of change following the fluid to time-rate of change as measured at a fixed location, finding
( rate of change with time following fluid ) = ( rate of change with time at fixed location ) + ( velocity of fluid ) ( rate of change with position )  rate of change   with time   following fluid  =  rate of change   with time at   fixed location  + (  velocity   of fluid  ) (  rate of change   with position  ) ([" rate of change "],[" with time "],[" following fluid "])=([" rate of change "],[" with time at "],[" fixed location "])+((" velocity ")/(" of fluid "))*((" rate of change ")/(" with position "))\left(\begin{array}{c} \text { rate of change } \\ \text { with time } \\ \text { following fluid } \end{array}\right)=\left(\begin{array}{c} \text { rate of change } \\ \text { with time at } \\ \text { fixed location } \end{array}\right)+\binom{\text { velocity }}{\text { of fluid }} \cdot\binom{\text { rate of change }}{\text { with position }}( rate of change  with time  following fluid )=( rate of change  with time at  fixed location )+( velocity  of fluid )( rate of change  with position )
or
(2) v t + ( v ) v = 1 ρ p (2) v t + ( v ) v = 1 ρ p {:(2)(del v)/(del t)+(v*grad)v=-(1)/(rho)grad p:}\begin{equation*} \frac{\partial v}{\partial t}+(v \cdot \nabla) v=-\frac{1}{\rho} \nabla p \tag{2} \end{equation*}(2)vt+(v)v=1ρp
or
v i t + v i , k v k = 1 ρ p , i . v i t + v i , k v k = 1 ρ p , i . (delv_(i))/(del t)+v_(i,k)v_(k)=-(1)/(rho)p_(,i).\frac{\partial v_{i}}{\partial t}+v_{i, k} v_{k}=-\frac{1}{\rho} p_{, i} .vit+vi,kvk=1ρp,i.
(Latin indices run from 1 to 3; summation convention; upper and lower indices used indifferently for space dimensions in flat space!) This is Euler's fundamental equation for the hydrodynamics of a perfect fluid.
Two further equations are needed to complete the description of a perfect fluid. One states the absence of heat transfer by requiring that the specific entropy (entropy per unit mass) be constant for each fluid "particle":
(3) d s d t = 0 , or s t + ( v ) s = 0 . (3) d s d t = 0 ,  or  s t + ( v ) s = 0 . {:(3)(ds)/(dt)=0","quad" or "quad(del s)/(del t)+(v*grad)s=0.:}\begin{equation*} \frac{d s}{d t}=0, \quad \text { or } \quad \frac{\partial s}{\partial t}+(\boldsymbol{v} \cdot \nabla) s=0 . \tag{3} \end{equation*}(3)dsdt=0, or st+(v)s=0.
The final equation expresses the conservation of mass:
(4) ρ t + ( ρ v ) = 0 (4) ρ t + ( ρ v ) = 0 {:(4)(del rho)/(del t)+grad*(rho v)=0:}\begin{equation*} \frac{\partial \rho}{\partial t}+\nabla \cdot(\rho \boldsymbol{v})=0 \tag{4} \end{equation*}(4)ρt+(ρv)=0
or
ρ t + ( ρ v k ) , k = 0 ρ t + ρ v k , k = 0 (del rho)/(del t)+(rhov_(k))_(,k)=0\frac{\partial \rho}{\partial t}+\left(\rho v_{k}\right)_{, k}=0ρt+(ρvk),k=0

Box 5.5 (continued)

it is analagous in every way to the equation that expresses conservation of charge in electrodynamics and that bears the same name, "equation of continuity."
The Newtonian stress-energy tensor, like its relativistic counterpart, is linked to conservation of momentum and mass. Therefore examine the time-rate of change of the density of fluid momentum, ρ v i ρ v i rhov_(i)\rho v_{i}ρvi, contained in a unit volume; thus,
(5) ( ρ v i ) / t = ( ρ v i v k ) k p , i . (5) ρ v i / t = ρ v i v k k p , i . {:(5)del(rhov_(i))//del t=-(rhov_(i)v_(k))_(k)-p_(,i).:}\begin{equation*} \partial\left(\rho v_{i}\right) / \partial t=-\left(\rho v_{i} v_{k}\right)_{k}-p_{, i} . \tag{5} \end{equation*}(5)(ρvi)/t=(ρvivk)kp,i.
Momentum flows into the little volume element on the left ("force equals time-rate of change of momentum") and out on the right; similarly at the other faces. Therefore the righthand side of (5) must represent the divergence of this momentum flux:
(6) ( ρ v i ) / t = T i k , k (6) ρ v i / t = T i k , k {:(6)del(rhov_(i))//del t=-T_(ik,k):}\begin{equation*} \partial\left(\rho v_{i}\right) / \partial t=-T_{i k, k} \tag{6} \end{equation*}(6)(ρvi)/t=Tik,k
Consequently, we take for the momentum flux itself
(7) T i k = T i k = "convection" ρ v i v k "push" + δ i k p . (7) T i k = T i k = "convection"  ρ v i v k  "push"  + δ i k p . {:(7)T^(ik)=T_(ik)=ubrace(ubrace)_(""convection" ")(rhov_(i)v_(k))/(" "push" ")+ubrace(ubrace)_(delta_(ik)p).:}\begin{equation*} T^{i k}=T_{i k}=\underbrace{}_{\text {"convection" }} \frac{\rho v_{i} v_{k}}{\text { "push" }}+\underbrace{}_{\delta_{i k} p} . \tag{7} \end{equation*}(7)Tik=Tik="convection" ρvivk "push" +δikp.
For the momentum density, the Newtonian value is
(8) T 0 i = T i 0 = ρ v i (8) T 0 i = T i 0 = ρ v i {:(8)T^(0i)=T^(i0)=rhov_(i):}\begin{equation*} T^{0 i}=T^{i 0}=\rho v_{i} \tag{8} \end{equation*}(8)T0i=Ti0=ρvi
With this notation, the equation for the time-rate of change of momentum becomes
(9) T i μ / x μ = 0 ; (9) T i μ / x μ = 0 ; {:(9)delT^(i mu)//delx^(mu)=0;:}\begin{equation*} \partial T^{i \mu} / \partial x^{\mu}=0 ; \tag{9} \end{equation*}(9)Tiμ/xμ=0;
and with T 00 = ρ T 00 = ρ T^(00)=rhoT^{00}=\rhoT00=ρ, the equation of continuity reads
(10) T 0 μ / x μ = 0 . (10) T 0 μ / x μ = 0 . {:(10)delT^(0mu)//delx^(mu)=0.:}\begin{equation*} \partial T^{0 \mu} / \partial x^{\mu}=0 . \tag{10} \end{equation*}(10)T0μ/xμ=0.
In conclusion, these Newtonian considerations give a reasonable approximation to the relativistic stress-energy tensor:
The 4-momentum of neither block nor field is conserved; neither T (block) T (block)  grad*T_((block) )\boldsymbol{\nabla} \cdot \boldsymbol{T}_{\text {(block) }}T(block)  nor T (em field) T (em field)  grad*T_((em field) )\boldsymbol{\nabla} \cdot \boldsymbol{T}_{\text {(em field) }}T(em field)  vanishes. But total 4-momentum must be conserved, so
(5.39) ( T (block) + T (em field ) ) must vanish. (5.39) T (block)  + T (em field  )  must vanish.  {:(5.39)grad*(T_((block) )+T_((em field )))" must vanish. ":}\begin{equation*} \boldsymbol{\nabla} \cdot\left(\boldsymbol{T}_{\text {(block) }}+\boldsymbol{T}_{\text {(em field })}\right) \text { must vanish. } \tag{5.39} \end{equation*}(5.39)(T(block) +T(em field )) must vanish. 
For a general electromagnetic field interacting with any source, T ( em field ) T ( em field  ) grad*T_(("em field "))\boldsymbol{\nabla} \cdot \boldsymbol{T}_{(\text {em field })}T(em field ) has the form
(5.40) T lem field ) , ν μ ν = F μ α J α . (5.40) T lem field  ) , ν μ ν = F μ α J α . {:(5.40)T_("lem field "),nu)^(mu nu)=-F^(mu alpha)J_(alpha).:}\begin{equation*} T_{\text {lem field }), \nu}^{\mu \nu}=-F^{\mu \alpha} J_{\alpha} . \tag{5.40} \end{equation*}(5.40)Tlem field ),νμν=FμαJα.
(This was derived in exercise 3.18 by combining T μ ν , ν = 0 T μ ν , ν = 0 T^(mu nu)_(,nu)=0T^{\mu \nu}{ }_{, \nu}=0Tμν,ν=0 with expression 5.22 for the electromagnetic stress-energy tensor, and with Maxwell's equations.) For our beaded block, J J J\boldsymbol{J}J is the 4 -current associated with the vibrating, charged beads, and F F F\boldsymbol{F}F is the electromagnetic field tensor. The time component of equation (5.40) reads
(5.41) T (em field , ν 0 ν = F 0 k J k = E J = ( rate at which electric field E does work on a unit volume of charged beads ) . (5.41) T (em field  , ν 0 ν = F 0 k J k = E J = (  rate at which electric field  E  does work   on a unit volume of charged beads  ) . {:[(5.41)T_((em field ,nu)^(0nu)=-F^(0k)J_(k)=-E*J],[=-((" rate at which electric field "E" does work ")/(" on a unit volume of charged beads ")).]:}\begin{align*} T_{\text {(em field }, \nu}^{0 \nu} & =-F^{0 k} J_{k}=-\boldsymbol{E} \cdot \boldsymbol{J} \tag{5.41}\\ & =-\binom{\text { rate at which electric field } \boldsymbol{E} \text { does work }}{\text { on a unit volume of charged beads }} . \end{align*}(5.41)T(em field ,ν0ν=F0kJk=EJ=( rate at which electric field E does work  on a unit volume of charged beads ).
For comparison, T (block), 0 00 T (block),  0 00 T_((block), 0)^(00)T_{\text {(block), } 0}^{00}T(block), 000 is the rate at which the block's energy density changes with time, T (block), j is 0 j T (block),  j  is  0 j -T_((block), j" is ")^(0j)-T_{\text {(block), } j \text { is }}^{0 j}T(block), j is 0j ithe contribution of the block's energy flux to this rate of change of energy density, and consequently their difference T (block), , 0 ν T (block),  , 0 ν T_((block), ,)^(0nu)T_{\text {(block), },}^{0 \nu}T(block), ,0ν, has the meaning
(5.42) T (block) , v 0 v = ( rate at which mass-energy of block per unit volume increases due to actions other than internal mechanical forces between one part of block and another ) (5.42) T (block)  , v 0 v =  rate at which mass-energy of block per   unit volume increases due to actions   other than internal mechanical forces   between one part of block and another  {:(5.42)T_((block) ,v)^(0v)=([" rate at which mass-energy of block per "],[" unit volume increases due to actions "],[" other than internal mechanical forces "],[" between one part of block and another "]):}T_{\text {(block) }, v}^{0 v}=\left(\begin{array}{l} \text { rate at which mass-energy of block per } \tag{5.42}\\ \text { unit volume increases due to actions } \\ \text { other than internal mechanical forces } \\ \text { between one part of block and another } \end{array}\right)(5.42)T(block) ,v0v=( rate at which mass-energy of block per  unit volume increases due to actions  other than internal mechanical forces  between one part of block and another )
Hence, the conservation law
( T (em field) 0 ν + T (block) 0 ν ) , ν = 0 T (em field)  0 ν + T (block)  0 ν , ν = 0 (T_((em field) )^(0nu)+T_((block) )^(0nu))_(,nu)=0\left(T_{\text {(em field) }}^{0 \nu}+T_{\text {(block) }}^{0 \nu}\right)_{, \nu}=0(T(em field) 0ν+T(block) 0ν),ν=0
says that the mass-energy of the block increases at precisely the same rate as the electric field does work on the beads. A similar result holds for momentum:
(5.44) T (em field) , ν k ν e k = F k ν J ν e k = ( J 0 E + J × B ) = ( Lorentz force per unit volume acting on beads ) , T (block), , k ν e k = ( rate at which momentum per unit volume of block increases due to actions other than its own stresses ) ; (5.44) T (em field)  , ν k ν e k = F k ν J ν e k = J 0 E + J × B = (  Lorentz force per unit volume   acting on beads  ) , T (block),  , k ν e k =  rate at which momentum per unit volume   of block increases due to actions   other than its own stresses  ; {:(5.44){:[T_((em field) ,nu)^(k nu)e_(k),=-F^(k nu)J_(nu)e_(k)=-(J^(0)E+J xx B)],[,=-((" Lorentz force per unit volume ")/(" acting on beads "))","],[T_((block), ,)^(k nu)e_(k),=([" rate at which momentum per unit volume "],[" of block increases due to actions "],[" other than its own stresses "]:}]);:}\left.\begin{array}{rl} T_{\text {(em field) }, \nu}^{k \nu} \boldsymbol{e}_{k} & =-F^{k \nu} J_{\nu} \boldsymbol{e}_{k}=-\left(J^{0} \boldsymbol{E}+\boldsymbol{J} \times \boldsymbol{B}\right) \\ & =-\binom{\text { Lorentz force per unit volume }}{\text { acting on beads }}, \\ T_{\text {(block), },}^{k \nu} \boldsymbol{e}_{k} & =\left(\begin{array}{l} \text { rate at which momentum per unit volume } \\ \text { of block increases due to actions } \\ \text { other than its own stresses } \end{array}\right. \tag{5.44} \end{array}\right) ;(5.44)T(em field) ,νkνek=FkνJνek=(J0E+J×B)=( Lorentz force per unit volume  acting on beads ),T(block), ,kνek=( rate at which momentum per unit volume  of block increases due to actions  other than its own stresses );
so the conservation law
( T (em field) k ν + T (block k ν ) , p = 0 T (em field)  k ν + T (block  k ν , p = 0 (T_((em field) )^(k nu)+T_((block )^(k nu))_(,p)=0\left(T_{\text {(em field) }}^{k \nu}+T_{\text {(block }}^{k \nu}\right)_{, p}=0(T(em field) kν+T(block kν),p=0
says that the rate of change of the momentum of the block equals the force of the electromagnetic field on its beads.
Angular momentum defined and its integral conservation law derived

§5.11. ANGULAR MOMENTUM

The symmetry, T μ ν = T ν μ T μ ν = T ν μ T^(mu nu)=T^(nu mu)T^{\mu \nu}=T^{\nu \mu}Tμν=Tνμ, of the stress-energy tensor enables one to define a conserved angular momentum J α β J α β J^(alpha beta)J^{\alpha \beta}Jαβ, analogous to the linear momentum p α p α p^(alpha)p^{\alpha}pα. The angular momentum is defined relative to a specific but arbitrary origin-an event a a aaa with coordinates, in a particular Lorentz frame,
(5.45) x α ( a ) = a α . (5.45) x α ( a ) = a α . {:(5.45)x^(alpha)(a)=a^(alpha).:}\begin{equation*} x^{\alpha}(a)=a^{\alpha} . \tag{5.45} \end{equation*}(5.45)xα(a)=aα.
The angular momentum about a a aaa is defined using the tensor
(5.46) g α β γ = ( x α a α ) T β γ ( x β a β ) T α γ . (5.46) g α β γ = x α a α T β γ x β a β T α γ . {:(5.46)g^(alpha beta gamma)=(x^(alpha)-a^(alpha))T^(beta gamma)-(x^(beta)-a^(beta))T^(alpha gamma).:}\begin{equation*} \mathscr{g}^{\alpha \beta \gamma}=\left(x^{\alpha}-a^{\alpha}\right) T^{\beta \gamma}-\left(x^{\beta}-a^{\beta}\right) T^{\alpha \gamma} . \tag{5.46} \end{equation*}(5.46)gαβγ=(xαaα)Tβγ(xβaβ)Tαγ.
(Note that x α a α x α a α x^(alpha)-a^(alpha)x^{\alpha}-a^{\alpha}xαaα is the vector separation of the "field point" x α x α x^(alpha)x^{\alpha}xα from the "origin" A ; T α γ A ; T α γ A;T^(alpha gamma)\mathfrak{A} ; T^{\alpha \gamma}A;Tαγ is here evaluated at the "field point".) Because of the symmetry of T , g α β γ T , g α β γ T,g^(alpha beta gamma)\boldsymbol{T}, g^{\alpha \beta \gamma}T,gαβγ has vanishing divergence:
g α β γ , γ = δ α γ T β γ + ( x α a α ) T β γ γ , 0 δ β γ T α γ ( x β a β ) T α γ , γ 0 (5.47) = T β α T α β = 0 . g α β γ , γ = δ α γ T β γ + x α a α T β γ γ , 0 δ β γ T α γ x β a β T α γ , γ 0 (5.47) = T β α T α β = 0 . {:[g^(alpha beta gamma)_(,gamma)=delta^(alpha)_(gamma)T^(beta gamma)+(x^(alpha)-a^(alpha))ubrace(T^(beta gamma)_(gamma),ubrace)_(0)-delta^(beta)_(gamma)T^(alpha gamma)-(x^(beta)-a^(beta))ubrace(T^(alpha gamma),gammaubrace)_(0)],[(5.47)=T^(beta alpha)-T^(alpha beta)=0.]:}\begin{align*} \mathcal{g}^{\alpha \beta \gamma}{ }_{, \gamma} & =\delta^{\alpha}{ }_{\gamma} T^{\beta \gamma}+\left(x^{\alpha}-a^{\alpha}\right) \underbrace{T^{\beta \gamma}{ }_{\gamma},}_{0}-\delta^{\beta}{ }_{\gamma} T^{\alpha \gamma}-\left(x^{\beta}-a^{\beta}\right) \underbrace{T^{\alpha \gamma}, \gamma}_{0} \\ & =T^{\beta \alpha}-T^{\alpha \beta}=0 . \tag{5.47} \end{align*}gαβγ,γ=δαγTβγ+(xαaα)Tβγγ,0δβγTαγ(xβaβ)Tαγ,γ0(5.47)=TβαTαβ=0.
Consequently, its integral over any closed 3 -surface vanishes
(5.48) γ g α β γ d 3 Σ γ = 0 (5.48) γ g α β γ d 3 Σ γ = 0 {:(5.48)oint_(del gamma)g^(alpha beta gamma)d^(3)Sigma_(gamma)=0:}\begin{equation*} \oint_{\partial \gamma} g^{\alpha \beta \gamma} d^{3} \Sigma_{\gamma}=0 \tag{5.48} \end{equation*}(5.48)γgαβγd3Σγ=0
("integral form of the law of conservation of angular momentum").
The integral over a spacelike surface of constant time t t ttt is
(5.49) J α β = g α β 0 d x d y d z = [ ( x α a α ) T β 0 ( x β a β ) T α 0 ] d x d y d z (5.49) J α β = g α β 0 d x d y d z = x α a α T β 0 x β a β T α 0 d x d y d z {:(5.49)J^(alpha beta)=intg^(alpha beta0)dxdydz=int[(x^(alpha)-a^(alpha))T^(beta0)-(x^(beta)-a^(beta))T^(alpha0)]dxdydz:}\begin{equation*} J^{\alpha \beta}=\int \mathscr{g}^{\alpha \beta 0} d x d y d z=\int\left[\left(x^{\alpha}-a^{\alpha}\right) T^{\beta 0}-\left(x^{\beta}-a^{\beta}\right) T^{\alpha 0}\right] d x d y d z \tag{5.49} \end{equation*}(5.49)Jαβ=gαβ0dxdydz=[(xαaα)Tβ0(xβaβ)Tα0]dxdydz
Recalling that T β 0 T β 0 T^(beta0)T^{\beta 0}Tβ0 is momentum density, one sees that (5.49) has the same form as the equation " J = r × p J = r × p J=r xx p\boldsymbol{J}=\boldsymbol{r} \times \boldsymbol{p}J=r×p " of Newtonian theory. Hence the name "total angular momentum" for J α β J α β J^(alpha beta)J^{\alpha \beta}Jαβ. Various aspects of this conserved angular momentum, including the tie to its Newtonian cousin, are explored in Box 5.6.

EXERCISES

Exercise 5.2. CHARGE CONSERVATION

Exercise 3.16 revealed that the charge-current 4-vector J J J\boldsymbol{J}J satisfies the differential conservation law J = 0 J = 0 grad*J=0\boldsymbol{\nabla} \cdot \boldsymbol{J}=0J=0. Write down the corresponding integral conservation law, and interpret it for the four closed surfaces of Fig. 5.3.

Exercise 5.3. PARTICLE PRODUCTION

Inside highly evolved, massive stars, the temperature is so high that electron-positron pairs are continually produced and destroyed. Let S S S\boldsymbol{S}S be the number-flux vector for electrons and positrons, and denote its divergence by
(5.50) ϵ S (5.50) ϵ S {:(5.50)epsilon-=grad*S:}\begin{equation*} \epsilon \equiv \nabla \cdot S \tag{5.50} \end{equation*}(5.50)ϵS

Box 5.6 ANGULAR MOMENTUM

A. Definition of Angular Momentum

(a) Pick an arbitrary spacelike hypersurface S S SSS and an arbitrary event a a aaa with coordinates x α ( A ) a α x α ( A ) a α x^(alpha)(A)-=a^(alpha)x^{\alpha}(\mathscr{A}) \equiv a^{\alpha}xα(A)aα. (Use globally inertial coordinates throughout.)
(b) Define "total angular momentum on S S SSS about a a aaa " to be
J μ ν S g μ ν α d 3 Σ α g μ ν α ( x μ a μ ) T ν α ( x ν a ν ) T μ α J μ ν S g μ ν α d 3 Σ α g μ ν α x μ a μ T ν α x ν a ν T μ α {:[J^(mu nu)-=int_(S)g^(mu nu alpha)d^(3)Sigma_(alpha)],[g^(mu nu alpha)-=(x^(mu)-a^(mu))T^(nu alpha)-(x^(nu)-a^(nu))T^(mu alpha)]:}\begin{aligned} J^{\mu \nu} & \equiv \int_{S} g^{\mu \nu \alpha} d^{3} \Sigma_{\alpha} \\ g^{\mu \nu \alpha} & \equiv\left(x^{\mu}-a^{\mu}\right) T^{\nu \alpha}-\left(x^{\nu}-a^{\nu}\right) T^{\mu \alpha} \end{aligned}JμνSgμναd3Σαgμνα(xμaμ)Tνα(xνaν)Tμα

(c) If S S SSS is a hypersurface of constant time t t ttt, this becomes
J μ ν = g μ ν 0 d x d y d z J μ ν = g μ ν 0 d x d y d z J^(mu nu)=intg^(mu nu0)dxdydzJ^{\mu \nu}=\int g^{\mu \nu 0} d x d y d zJμν=gμν0dxdydz

B. Conservation of Angular Momentum

(a) T μ ν , ν = 0 T μ ν , ν = 0 T^(mu nu)_(,nu)=0T^{\mu \nu}{ }_{, \nu}=0Tμν,ν=0 implies g μ ν α , α = 0 g μ ν α , α = 0 g^(mu nu alpha)_(,alpha)=0\mathscr{g}^{\mu \nu \alpha}{ }_{, \alpha}=0gμνα,α=0.
(b) This means that J μ ν J μ ν J^(mu nu)J^{\mu \nu}Jμν is independent of the hypersurface S S SSS on which it is calculated (Gauss's theorem):
J μ ν ( S A ) J μ ν ( S B ) = V g μ ν α d 3 Σ α = V G μ ν α , α d 4 x = 0 . J μ ν S A J μ ν S B = V g μ ν α d 3 Σ α = V G μ ν α , α d 4 x = 0 . {:[J^(mu nu)(S_(A))-J^(mu nu)(S_(B))],[quad=int_(del V)g^(mu nu alpha)d^(3)Sigma_(alpha)],[quad=int_(V)G^(mu nu alpha)_(,alpha)d^(4)x=0.]:}\begin{aligned} & J^{\mu \nu}\left(S_{A}\right)-J^{\mu \nu}\left(S_{B}\right) \\ & \quad=\int_{\partial V} \mathscr{g}^{\mu \nu \alpha} d^{3} \Sigma_{\alpha} \\ & \quad=\int_{\mathscr{V}} \mathscr{G}^{\mu \nu \alpha}{ }_{, \alpha} d^{4} x=0 . \end{aligned}Jμν(SA)Jμν(SB)=Vgμναd3Σα=VGμνα,αd4x=0.

(Note: V V delV-=\partial \mathscr{V} \equivV (boundary of V V V\mathscr{V}V ) includes S A , S B S A , S B S_(A),S_(B)S_{A}, S_{B}SA,SB, and timelike surfaces at spatial infinity; contribution of latter dropped-localized source.)

C. Change of Point About Which Angular Momentum is Calculated

Let b α b α b^(alpha)b^{\alpha}bα be vector from a 0 a 0 a_(0)a_{0}a0 to a 1 : b α = a 1 α a 0 α a 1 : b α = a 1 α a 0 α a_(1):b^(alpha)=a_(1)^(alpha)-a_(0)^(alpha)a_{1}: b^{\alpha}=a_{1}{ }^{\alpha}-a_{0}{ }^{\alpha}a1:bα=a1αa0α. Then
J μ ν ( about a 1 ) J μ ν ( about a 0 ) = b μ S T ν α d 3 Σ α + b ν S T μ α d 3 Σ α = b μ P ν + b ν P μ J μ ν  about  a 1 J μ ν  about  a 0 = b μ S T ν α d 3 Σ α + b ν S T μ α d 3 Σ α = b μ P ν + b ν P μ {:[J^(mu nu)(" about "a_(1))-J^(mu nu)(" about "a_(0))],[quad=-b^(mu)int_(S)T^(nu alpha)d^(3)Sigma_(alpha)+b^(nu)int_(S)T^(mu alpha)d^(3)Sigma_(alpha)],[quad=-b^(mu)P^(nu)+b^(nu)P^(mu)]:}\begin{aligned} & J^{\mu \nu}\left(\text { about } \mathscr{a}_{1}\right)-J^{\mu \nu}\left(\text { about } \mathscr{a}_{0}\right) \\ & \quad=-b^{\mu} \int_{S} T^{\nu \alpha} d^{3} \Sigma_{\alpha}+b^{\nu} \int_{S} T^{\mu \alpha} d^{3} \Sigma_{\alpha} \\ & \quad=-b^{\mu} P^{\nu}+b^{\nu} P^{\mu} \end{aligned}Jμν( about a1)Jμν( about a0)=bμSTναd3Σα+bνSTμαd3Σα=bμPν+bνPμ

where P μ P μ P^(mu)P^{\mu}Pμ is total 4-momentum.
Box 5.6 (continued)
D. Intrinsic Angular Momentum
(a) Work, for a moment, in the system's rest frame, where P 0 = M , P j = 0 , x C M j = 1 M x j T 00 d 3 x = P 0 = M , P j = 0 , x C M j = 1 M x j T 00 d 3 x = P^(0)=M,quadP^(j)=0,quadx_(CM)^(j)=(1)/(M)intx^(j)T^(00)d^(3)x=P^{0}=M, \quad P^{j}=0, \quad x_{C M}{ }^{j}=\frac{1}{M} \int x^{j} T^{00} d^{3} x=P0=M,Pj=0,xCMj=1MxjT00d3x= location of center of mass.
Intrinsic angular momentum is defined as angular momentum about any event ( a 0 , x C M j a 0 , x C M j a^(0),x_(CM)^(j)a^{0}, x_{C M}{ }^{j}a0,xCMj ) on center of mass's world line. Its components are denoted S μ ν S μ ν S^(mu nu)S^{\mu \nu}Sμν and work out to be
S 0 j = 0 , S j k = ϵ j k S , S 0 j = 0 , S j k = ϵ j k S , S^(0j)=0,S^(jk)=epsilon^(jkℓ)S^(ℓ),S^{0 j}=0, S^{j k}=\epsilon^{j k \ell} S^{\ell},S0j=0,Sjk=ϵjkS,
where
S ( x x C M ) × ( momentum density ) d 3 x "intrinsic angular momentum vector." S x x C M × (  momentum density  ) d 3 x  "intrinsic angular momentum vector."  {:[S-=int(x-x_(CM))xx(" momentum density ")d^(3)x],[-=" "intrinsic angular momentum vector." "]:}\begin{aligned} \boldsymbol{S} & \equiv \int\left(\boldsymbol{x}-\boldsymbol{x}_{C M}\right) \times(\text { momentum density }) d^{3} x \\ & \equiv \text { "intrinsic angular momentum vector." } \end{aligned}S(xxCM)×( momentum density )d3x "intrinsic angular momentum vector." 
(b) Define "intrinsic angular momentum 4 -vector" S μ S μ S^(mu)S^{\mu}Sμ to be that 4 -vector whose components in the rest frame are ( 0 , S ) ( 0 , S ) (0,S)(0, \boldsymbol{S})(0,S); then the above equations say
S μ ν = U α S β ϵ α β μ ν , U β P β / M = 4 -velocity of center of mass , U β S β = 0 . S μ ν = U α S β ϵ α β μ ν , U β P β / M = 4 -velocity of center of mass  , U β S β = 0 . {:[S^(mu nu)=U_(alpha)S_(beta)epsilon^(alpha beta mu nu)","],[U_(beta)-=P_(beta)//M=4"-velocity of center of mass "","],[U_(beta)S^(beta)=0.]:}\begin{aligned} S^{\mu \nu} & =U_{\alpha} S_{\beta} \epsilon^{\alpha \beta \mu \nu}, \\ U_{\beta} & \equiv P_{\beta} / M=4 \text {-velocity of center of mass }, \\ U_{\beta} S^{\beta} & =0 . \end{aligned}Sμν=UαSβϵαβμν,UβPβ/M=4-velocity of center of mass ,UβSβ=0.

E. Decomposition of Angular Momentum into Intrinsic and Orbital Parts

(a) Pick an arbitrary event a a aaa, whose perpendicular displacement from center-of-mass world line is Y α Y α -Y^(alpha)-Y^{\alpha}Yα, so
U β Y β = 0 . U β Y β = 0 . U_(beta)Y^(beta)=0.U_{\beta} Y^{\beta}=0 .UβYβ=0.

(b) Then, by Part C, the angular momentum about a a aaa is
J μ ν = U α S β ϵ α β μ ν S μ ν ( intrinsic ) + Y μ P ν Y ν P μ L μ ν ( orbital ) J μ ν = U α S β ϵ α β μ ν S μ ν (  intrinsic  ) + Y μ P ν Y ν P μ L μ ν (  orbital  ) J^(mu nu)=ubrace(U_(alpha)S_(beta)epsilon^(alpha beta mu nu)ubrace)_(S^(mu nu)(" intrinsic "))+ubrace(Y^(mu)P^(nu)-Y^(nu)P^(mu)ubrace)_(L^(mu nu)(" orbital "))J^{\mu \nu}=\underbrace{U_{\alpha} S_{\beta} \epsilon^{\alpha \beta \mu \nu}}_{S^{\mu \nu}(\text { intrinsic })}+\underbrace{Y^{\mu} P^{\nu}-Y^{\nu} P^{\mu}}_{L^{\mu \nu}(\text { orbital })}Jμν=UαSβϵαβμνSμν( intrinsic )+YμPνYνPμLμν( orbital )
(c) Knowing the angular momentum about a a a\mathfrak{a}a, and the 4 -momentum (and hence 4 -velocity), one can calculate the vector from a a aaa to the center-of-mass world line,
Y μ = J μ ν P ν / M 2 , Y μ = J μ ν P ν / M 2 , Y^(mu)=-J^(mu nu)P_(nu)//M^(2),Y^{\mu}=-J^{\mu \nu} P_{\nu} / M^{2},Yμ=JμνPν/M2,
and the intrinsic angular momentum
S ρ = 1 2 U σ J μ ν ϵ σ μ ν ρ . S ρ = 1 2 U σ J μ ν ϵ σ μ ν ρ . S_(rho)=(1)/(2)U^(sigma)J^(mu nu)epsilon_(sigma mu nu rho).S_{\rho}=\frac{1}{2} U^{\sigma} J^{\mu \nu} \epsilon_{\sigma \mu \nu \rho} .Sρ=12UσJμνϵσμνρ.
Use Gauss's theorem to show that ϵ ϵ epsilon\epsilonϵ is the number of particles created (minus the number destroyed) in a unit four-dimensional volume of spacetime.

Exercise 5.4. INERTIAL MASS PER UNIT VOLUME

Consider a stressed medium in motion with ordinary velocity | v | 1 | v | 1 |v|≪1|\boldsymbol{v}| \ll 1|v|1 with respect to a specific Lorentz frame.
(a) Show by Lorentz transformations that the spatial components of the momentum density are
(5.51) T 0 j = k m j k v k (5.51) T 0 j = k m j k v k {:(5.51)T^(0j)=sum_(k)m^(jk)v^(k):}\begin{equation*} T^{0 j}=\sum_{k} m^{j k} v^{k} \tag{5.51} \end{equation*}(5.51)T0j=kmjkvk
where
(5.52) m j k = T 0 0 δ j k + T j k (5.52) m j k = T 0 ¯ 0 ¯ δ j k + T j k ¯ {:(5.52)m^(jk)=T^( bar(0) bar(0))delta^(jk)+T^( bar(jk)):}\begin{equation*} m^{j k}=T^{\overline{0} \overline{0}} \delta^{j k}+T^{\overline{j k}} \tag{5.52} \end{equation*}(5.52)mjk=T00δjk+Tjk
and T μ ¯ ν ¯ T μ ¯ ν ¯ T^( bar(mu) bar(nu))T^{\bar{\mu} \bar{\nu}}Tμ¯ν¯ are the components of the stress-energy tensor in the rest frame of the medium. Throughout the solar system T 00 | T 3 k | T 00 ¯ T 3 k T^( bar(00))≫|T^(3k)|T^{\overline{00}} \gg\left|T^{3 k}\right|T00|T3k| (see, e.g., discussion of hurricane in § 5.10 § 5.10 §5.10\S 5.10§5.10 ), so one is accustomed to write T 0 j = T 00 v j T 0 j = T 00 ¯ v j T^(0j)=T^( bar(00))v^(j)T^{0 j}=T^{\overline{00}} v^{j}T0j=T00vj, i.e., " ( ( ((( momentum density ) = ( ) = ( )=()=()=( rest-mass density ) × ( ) × ( )xx() \times()×( velocity)". But inside a neutron star T 0 0 T 0 ¯ 0 ¯ T^( bar(0) bar(0))T^{\overline{0} \overline{0}}T00 may be of the same order of magnitude as T j k ¯ T j k ¯ T^(j bar(k))T^{j \bar{k}}Tjk¯, so one must replace "(momentum density) = = === (rest-mass density) × × xx\times× (velocity)" by equations (5.51) and (5.52), at low velocities.
(b) Derive equations ( 5.51 ) and (5.52) from Newtonian considerations plus the equivalence of mass and energy. (Hint: the total mass-energy carried past the observer by a volume V V VVV of the medium includes both the rest mass T 00 V T 00 ¯ V T^( bar(00))VT^{\overline{00}} VT00V and the work done by forces acting across the volume's faces as they "push" the volume through a distance.)
(c) As a result of relation (5.51), the force per unit volume required to produce an acceleration d v k / d t d v k / d t dv^(k)//dtd v^{k} / d tdvk/dt in a stressed medium, which is at rest with respect to the man who applies the force, is
(5.53) F j = d T 0 j / d t = k m j k d v k / d t (5.53) F j = d T 0 j / d t = k m j k d v k / d t {:(5.53)F^(j)=dT^(0j)//dt=sum_(k)m^(jk)dv^(k)//dt:}\begin{equation*} F^{j}=d T^{0 j} / d t=\sum_{k} m^{j k} d v^{k} / d t \tag{5.53} \end{equation*}(5.53)Fj=dT0j/dt=kmjkdvk/dt
This equation suggests that one call m j k m j k m^(jk)m^{j k}mjk the "inertial mass per unit volume" of a stressed medium at rest. In general m i k m i k m^(ik)m^{i k}mik is a symmetric 3-tensor. What does it become for the special case of a perfect fluid?
(d) Consider an isolated, stressed body at rest and in equilibrium ( T α β , 0 = 0 T α β , 0 = 0 T^(alpha beta)_(,0)=0T^{\alpha \beta}{ }_{, 0}=0Tαβ,0=0 ) in the laboratory frame. Show that its total inertial mass, defined by
(5.54) M i j = stressed body m i j d x d y d z (5.54) M i j =  stressed   body  m i j d x d y d z {:(5.54)M^(ij)=int_({:[" stressed "],[" body "]:})m^(ij)dxdydz:}\begin{equation*} M^{i j}=\int_{\substack{\text { stressed } \\ \text { body }}} m^{i j} d x d y d z \tag{5.54} \end{equation*}(5.54)Mij= stressed  body mijdxdydz
is isotropic and equals the rest mass of the body
(5.55) M i j = δ i j T n 0 d x d y d z (5.55) M i j = δ i j T n 0 d x d y d z {:(5.55)M^(ij)=delta^(ij)intT^(n0)dxdydz:}\begin{equation*} M^{i j}=\delta^{i j} \int T^{n 0} d x d y d z \tag{5.55} \end{equation*}(5.55)Mij=δijTn0dxdydz

Exercise 5.5. DETERMINANTS AND JACOBIANS

(a) Write out explicitly the sum defining d 2 S 01 d 2 S 01 d^(2)S_(01)d^{2} S_{01}d2S01 in
d 2 S μ ν ϵ μ ν α β x α a x β b d a d b . d 2 S μ ν ϵ μ ν α β x α a x β b d a d b . d^(2)S_(mu nu)-=epsilon_(mu nu alpha beta)(delx^(alpha))/(del a)(delx^(beta))/(del b)dadb.d^{2} S_{\mu \nu} \equiv \epsilon_{\mu \nu \alpha \beta} \frac{\partial x^{\alpha}}{\partial a} \frac{\partial x^{\beta}}{\partial b} d a d b .d2Sμνϵμναβxαaxβbdadb.
Thereby establish the formula
d 2 S μ ν = ϵ μ ν | α β | ( x α , x β ) ( a , b ) d a d b = 1 2 ! ϵ μ ν α β ( x α , x β ) ( a , b ) d a d b . d 2 S μ ν = ϵ μ ν | α β | x α , x β ( a , b ) d a d b = 1 2 ! ϵ μ ν α β x α , x β ( a , b ) d a d b . d^(2)S_(mu nu)=epsilon_(mu nu|alpha beta|)(del(x^(alpha),x^(beta)))/(del(a,b))dadb=(1)/(2!)epsilon_(mu nu alpha beta)(del(x^(alpha),x^(beta)))/(del(a,b))dadb.d^{2} S_{\mu \nu}=\epsilon_{\mu \nu|\alpha \beta|} \frac{\partial\left(x^{\alpha}, x^{\beta}\right)}{\partial(a, b)} d a d b=\frac{1}{2!} \epsilon_{\mu \nu \alpha \beta} \frac{\partial\left(x^{\alpha}, x^{\beta}\right)}{\partial(a, b)} d a d b .d2Sμν=ϵμν|αβ|(xα,xβ)(a,b)dadb=12!ϵμναβ(xα,xβ)(a,b)dadb.
(Expressions such as these should occur only under integral signs. In this exercise one may either supply an int dots\int \ldots wherever necessary, or else interpret the differentials in terms of the exterior calculus, d a d b d a d b d a d b d a d b dadb longrightarrow da^^dbd a d b \longrightarrow \boldsymbol{d} a \wedge \boldsymbol{d} bdadbdadb; see Box 5.4.) The notation used here for Jacobian determinants is
( f , g ) ( a , b ) = | f a f b g a g b | ( f , g ) ( a , b ) = f a f b g a g b (del(f,g))/(del(a,b))=|[(del f)/(del a),(del f)/(del b)],[(del g)/(del a),(del g)/(del b)]|\frac{\partial(f, g)}{\partial(a, b)}=\left|\begin{array}{ll} \frac{\partial f}{\partial a} & \frac{\partial f}{\partial b} \\ \frac{\partial g}{\partial a} & \frac{\partial g}{\partial b} \end{array}\right|(f,g)(a,b)=|fafbgagb|
(b) By a similar inspection of a specific case, show that
d 3 Σ μ ϵ μ α β γ x α a x β b x γ c d a d b d c = 1 3 ! ϵ μ α β γ ( x α , x β , x γ ) ( a , b , c ) d a d b d c d 3 Σ μ ϵ μ α β γ x α a x β b x γ c d a d b d c = 1 3 ! ϵ μ α β γ x α , x β , x γ ( a , b , c ) d a d b d c d^(3)Sigma_(mu)-=epsilon_(mu alpha beta gamma)(delx^(alpha))/(del a)(delx^(beta))/(del b)(delx^(gamma))/(del c)dadbdc=(1)/(3!)epsilon_(mu alpha beta gamma)(del(x^(alpha),x^(beta),x^(gamma)))/(del(a,b,c))dadbdcd^{3} \Sigma_{\mu} \equiv \epsilon_{\mu \alpha \beta \gamma} \frac{\partial x^{\alpha}}{\partial a} \frac{\partial x^{\beta}}{\partial b} \frac{\partial x^{\gamma}}{\partial c} d a d b d c=\frac{1}{3!} \epsilon_{\mu \alpha \beta \gamma} \frac{\partial\left(x^{\alpha}, x^{\beta}, x^{\gamma}\right)}{\partial(a, b, c)} d a d b d cd3Σμϵμαβγxαaxβbxγcdadbdc=13!ϵμαβγ(xα,xβ,xγ)(a,b,c)dadbdc
(c) Cite a precise definition of the value of a determinant as a sum of terms (with suitably alternating signs), with each term a product containing one factor from each row and simultaneously one factor from each column. Show that this definition can be stated (in the 4 × 4 4 × 4 4xx44 \times 44×4 case, with the p × p p × p p xx pp \times pp×p case an obvious extension) as
det A det A λ ρ = ϵ α β γ δ A α 0 A β 1 A γ 2 A 8 3 . det A det A λ ρ = ϵ α β γ δ A α 0 A β 1 A γ 2 A 8 3 . det A-=det||A^(lambda)_(rho)||=epsilon_(alpha beta gamma delta)A^(alpha)_(0)A^(beta)_(1)A^(gamma)_(2)A^(8)_(3).\operatorname{det} A \equiv \operatorname{det}\left\|A^{\lambda}{ }_{\rho}\right\|=\epsilon_{\alpha \beta \gamma \delta} A^{\alpha}{ }_{0} A^{\beta}{ }_{1} A^{\gamma}{ }_{2} A^{8}{ }_{3} .detAdetAλρ=ϵαβγδAα0Aβ1Aγ2A83.
(d) Show that
det A = 1 4 ! δ α β γ δ μ ν ρ σ A α μ A β ν A γ ρ A δ σ det A = 1 4 ! δ α β γ δ μ ν ρ σ A α μ A β ν A γ ρ A δ σ det A=(1)/(4!)delta_(alpha beta gamma delta)^(mu nu rho sigma)A^(alpha)_(mu)A^(beta)_(nu)A^(gamma)_(rho)A^(delta)_(sigma)\operatorname{det} A=\frac{1}{4!} \delta_{\alpha \beta \gamma \delta}^{\mu \nu \rho \sigma} A^{\alpha}{ }_{\mu} A^{\beta}{ }_{\nu} A^{\gamma}{ }_{\rho} A^{\delta}{ }_{\sigma}detA=14!δαβγδμνρσAαμAβνAγρAδσ
(for a definition of δ α β γ δ μ ν ρ σ δ α β γ δ μ ν ρ σ delta_(alpha beta gamma delta)^(mu nu rho sigma)\delta_{\alpha \beta \gamma \delta}^{\mu \nu \rho \sigma}δαβγδμνρσ, see exercises 3.13 and 4.12).
(e) Use properties of the δ δ delta\deltaδ-symbol to show that the matrix A 1 A 1 A^(-1)A^{-1}A1 inverse to A A AAA has entries ( A 1 ) μ α A 1 μ α (A^(-1))^(mu)_(alpha)\left(A^{-1}\right)^{\mu}{ }_{\alpha}(A1)μα given by
( A 1 ) μ α ( det A ) = 1 3 ! δ α β γ δ μ ν ρ σ A β ν A γ ρ A δ σ . A 1 μ α ( det A ) = 1 3 ! δ α β γ δ μ ν ρ σ A β ν A γ ρ A δ σ . (A^(-1))^(mu)_(alpha)(det A)=(1)/(3!)delta_(alpha beta gamma delta)^(mu nu rho sigma)A^(beta)_(nu)A^(gamma)_(rho)A^(delta)_(sigma).\left(A^{-1}\right)^{\mu}{ }_{\alpha}(\operatorname{det} A)=\frac{1}{3!} \delta_{\alpha \beta \gamma \delta}^{\mu \nu \rho \sigma} A^{\beta}{ }_{\nu} A^{\gamma}{ }_{\rho} A^{\delta}{ }_{\sigma} .(A1)μα(detA)=13!δαβγδμνρσAβνAγρAδσ.
(f) By an "index-mechanics" computation, from the formula for det A det A det A\operatorname{det} AdetA in part (d) derive the following expression for the derivative of the logarithm of the determinant
d ln | det A | = trace ( A 1 d A ) d ln | det A | = trace A 1 d A d ln |det A|=trace(A^(-1)dA)\boldsymbol{d} \ln |\operatorname{det} A|=\operatorname{trace}\left(A^{-1} \boldsymbol{d} A\right)dln|detA|=trace(A1dA)
Here d A d A dA\boldsymbol{d} AdA is the matrix d A α μ d A α μ ||dA^(alpha)_(mu)||\left\|\boldsymbol{d} A^{\alpha}{ }_{\mu}\right\|dAαμ whose entries are 1-forms.

Exercise 5.6. CENTROIDS AND SIZES

Consider an isolated system with stress-energy tensor T μ ν T μ ν T^(mu nu)T^{\mu \nu}Tμν, total 4-momentum P α P α P^(alpha)P^{\alpha}Pα, magnitude of 4 -momentum M = ( P P ) 1 / 2 M = ( P P ) 1 / 2 M=(-P*P)^(1//2)M=(-\boldsymbol{P} \cdot \boldsymbol{P})^{1 / 2}M=(PP)1/2, intrinsic angular momentum tensor S α β S α β S^(alpha beta)S^{\alpha \beta}Sαβ, and intrinsic angular momentum vector S α S α S^(alpha)S^{\alpha}Sα. (See Box 5.6.) An observer with 4 -velocity u α u α u^(alpha)u^{\alpha}uα defines the centroid of the system, at his Lorentz time x 0 = t x 0 = t x^(0)=tx^{0}=tx0=t and in his own Lorentz frame, by
(5.56) X u j ( t ) = ( 1 / P 0 ) x 0 = t x j T 00 d 3 x in Lorentz frame where u = P / x 0 (5.56) X u j ( t ) = 1 / P 0 x 0 = t x j T 00 d 3 x  in Lorentz frame where  u = P / x 0 {:(5.56)X_(u)^(j)(t)=(1//P^(0))int_(x^(0)=t)x^(j)T^(00)d^(3)x quad" in Lorentz frame where "u=delP//delx^(0):}\begin{equation*} X_{u}^{j}(t)=\left(1 / P^{0}\right) \int_{x^{0}=t} x^{j} T^{00} d^{3} x \quad \text { in Lorentz frame where } \boldsymbol{u}=\partial \mathscr{P} / \partial x^{0} \tag{5.56} \end{equation*}(5.56)Xuj(t)=(1/P0)x0=txjT00d3x in Lorentz frame where u=P/x0
This centroid depends on (i) the particular system being studied, (ii) the 4 -velocity u u u\boldsymbol{u}u of the observer, and (iii) the time t t ttt at which the system is observed.
(a) Show that the centroid moves with a uniform velocity
(5.57) d X u j / d t = P j / P 0 (5.57) d X u j / d t = P j / P 0 {:(5.57)dX_(u)^(j)//dt=P^(j)//P^(0):}\begin{equation*} d X_{u}^{j} / d t=P^{j} / P^{0} \tag{5.57} \end{equation*}(5.57)dXuj/dt=Pj/P0
corresponding to the 4 -velocity
( ) U = P / M ( ) U = P / M {:('")"U=P//M:}\begin{equation*} \boldsymbol{U}=\boldsymbol{P} / M \tag{$\prime$} \end{equation*}()U=P/M
Note that this " 4 -velocity of centroid" is independent of the 4 -velocity u u u\boldsymbol{u}u used in defining the centroid.
(b) The centroid associated with the rest frame of the system (i.e., the centroid defined with u = U u = U u=U\boldsymbol{u}=\boldsymbol{U}u=U ) is called the center of mass; see Box 5.6 . Let ξ u ξ u xi_(u)\xi_{u}ξu be a vector reaching from any event on the center-of-mass world line to any event on the world line of the centroid associated with 4 -velocity u u u\boldsymbol{u}u; thus the components of ξ u ξ u xi_(u)\boldsymbol{\xi}_{\boldsymbol{u}}ξu in any coordinate system are
(5.58) ξ u α = X u α X U α (5.58) ξ u α = X u α X U α {:(5.58)xi_(u)^(alpha)=X_(u)^(alpha)-X_(U)^(alpha):}\begin{equation*} \xi_{u}^{\alpha}=X_{u}^{\alpha}-X_{\boldsymbol{U}}^{\alpha} \tag{5.58} \end{equation*}(5.58)ξuα=XuαXUα
Show that ξ u ξ u xi_(u)\xi_{u}ξu satisfies the equation
(5.59) [ ( ξ u α P β P α ξ u β ) S α β ] u β = 0 (5.59) ξ u α P β P α ξ u β S α β u β = 0 {:(5.59)[(xi_(u)^(alpha)P^(beta)-P^(alpha)xi_(u)^(beta))-S^(alpha beta)]u_(beta)=0:}\begin{equation*} \left[\left(\xi_{u}^{\alpha} P^{\beta}-P^{\alpha} \xi_{u}^{\beta}\right)-S^{\alpha \beta}\right] u_{\beta}=0 \tag{5.59} \end{equation*}(5.59)[(ξuαPβPαξuβ)Sαβ]uβ=0
[Hint: perform the calculation in a Lorentz frame where u = P / x 0 u = P / x 0 u=delP//delx^(0)\boldsymbol{u}=\partial \mathscr{P} / \partial x^{0}u=P/x0.]
(c) Show that, as seen in the rest-frame of the system at any given moment of time, the above equation reduces to the three-dimensional Euclidean equation
( ) ξ u = ( v × S ) / M ( ) ξ u = ( v × S ) / M {:('")"xi_(u)=-(v xx S)//M:}\begin{equation*} \xi_{u}=-(v \times S) / M \tag{$\prime$} \end{equation*}()ξu=(v×S)/M
where v = u / u 0 v = u / u 0 v=u//u^(0)\boldsymbol{v}=\boldsymbol{u} / u^{0}v=u/u0 is the ordinary velocity of the frame associated with the centroid.
(d) Assume that the energy density measured by any observer anywhere in spacetime is
non-negative ( u T u 0 ( u T u 0 (u*T*u >= 0(\boldsymbol{u} \cdot \boldsymbol{T} \cdot \boldsymbol{u} \geq 0(uTu0 for all timelike u ) u ) u)\boldsymbol{u})u). In the rest frame of the system, construct the smallest possible cylinder that is parallel to S S S\boldsymbol{S}S and that contains the entire system ( T α β = 0 T α β = 0 T_(alpha beta)=0T_{\alpha \beta}=0Tαβ=0 everywhere outside the cylinder). Show that the radius r 0 r 0 r_(0)r_{0}r0 of this cylinder is limited by
(5.60) r 0 | S | / M (5.60) r 0 | S | / M {:(5.60)r_(0) >= |S|//M:}\begin{equation*} r_{0} \geq|\boldsymbol{S}| / M \tag{5.60} \end{equation*}(5.60)r0|S|/M
Thus, a system with given intrinsic angular momentum S S S\boldsymbol{S}S and given mass M M MMM has a minimum possible size r 0 min = | S | / M r 0  min  = | S | / M r_(0" min ")=|S|//Mr_{0 \text { min }}=|\boldsymbol{S}| / Mr0 min =|S|/M as measured in its rest frame.

синотет 6

ACCELERATED OBSERVERS

The objective world simply is; it does not happen. Only to the gaze of my consciousness, crawling upward along the life line [world line] of my body, does a section of this world come to life as a fleeting image in space which continuously changes in time.

§6.1. ACCELERATED OBSERVERS CAN BE ANALYZED USING SPECIAL RELATIVITY

It helps in analyzing gravitation to consider a situation where gravity is mocked up by acceleration. Focus attention on a region so far from any attracting matter, and so free of disturbance, that (to some proposed degree of precision) spacetime there can be considered to be flat and to have Lorentz geometry. Let the observer acquire the feeling that he is subject to gravity, either because of jet rockets strapped to his legs or because he is in a rocket-driven spaceship. How does physics look to him?
Dare one answer this question? At this early stage in the book, is one not too ignorant of gravitation physics to predict what physical effects will be measured by an observer who thinks he is in a gravitational field, although he is really in an accelerated spaceship? Quite the contrary; special relativity was developed precisely to predict the physics of accelerated objects-e.g., the radiation from an accelerated charge. Even the fantastic accelerations
a nuclear v 2 / r 10 31 cm / sec 2 10 28 "earth gravities" a nuclear  v 2 / r 10 31 cm / sec 2 10 28  "earth gravities"  a_("nuclear ")∼v^(2)//r∼10^(31)cm//sec^(2)∼10^(28)" "earth gravities" "a_{\text {nuclear }} \sim v^{2} / r \sim 10^{31} \mathrm{~cm} / \mathrm{sec}^{2} \sim 10^{28} \text { "earth gravities" }anuclear v2/r1031 cm/sec21028 "earth gravities" 
suffered by a neutron bound in a nucleus, and the even greater accelerations met in high-energy particle-scattering events, are routinely and accurately treated within
Accelerated motion and accelerated observers can be analyzed using special relativity

Box 6.1 GENERAL RELATIVITY IS BUILT ON SPECIAL RELATIVITY

A tourist in a powered interplanetary rocket feels "gravity." Can a physicist by local effects convince him that this "gravity" is bogus? Never, says Einstein's principle of the local equivalence of gravity and accelerations. But then the physicist will make no errors if he deludes himself into treating true gravity as a local illusion caused by acceleration. Under this delusion, he barges ahead and solves gravitational problems by using special relativity: if he is clever enough to divide every problem into a network of local questions, each solvable under such a delusion, then he can work out all influ-
ences of any gravitational field. Only three basic principles are invoked: special-relativity physics, the equivalence principle, and the local nature of physics. They are simple and clear. To apply them, however, imposes a double task: (1) take spacetime apart into locally flat pieces (where the principles are valid), and (2) put these pieces together again into a comprehensible picture. To undertake this dissection and reconstitution, to see curved dynamic spacetime inescapably take form, and to see the consequences for physics: that is general relativity.
the framework of special relativity. The theoretician who confidently applies special relativity to antiproton annihilations and strange-particle resonances is not about to be frightened off by the mere illusions of a rocket passenger who gullibly believed the travel brochures advertising "earth gravity all the way." When spacetime is flat, move however one will, special relativity can handle the job. (It can handle bigger jobs too; see Box 6.1.) The essential features of how special relativity handles the job are summarized in Box 6.2 for the benefit of the Track-1 reader, who can skip the rest of the chapter, and also for the benefit of the Track- 2 reader, who will find it useful background for the rest of the chapter.

Box 6.2 ACCELERATED OBSERVERS IN BRIEF

An accelerated observer can carry clocks and measuring rods with him, and can use them to set up a reference frame (coordinate system) in his neighborhood.
His clocks, if carefully chosen so their structures are affected negligibly by acceleration (e.g., atomic clocks), will tick at the same rate as unaccelerated clocks moving momentarily along with him:
Δ τ ( time interval ticked off by observer's clocks as he moves a vector displacement ξ along his world line ) = [ g ( ξ , ξ ) ] 1 / 2 Δ τ  time interval ticked off   by observer's clocks as he   moves a vector displacement  ξ  along his world line  = [ g ( ξ , ξ ) ] 1 / 2 Delta tau-=([" time interval ticked off "],[" by observer's clocks as he "],[" moves a vector displacement "],[xi" along his world line "])=[-g(xi,xi)]^(1//2)\Delta \tau \equiv\left(\begin{array}{c} \text { time interval ticked off } \\ \text { by observer's clocks as he } \\ \text { moves a vector displacement } \\ \xi \text { along his world line } \end{array}\right)=[-\boldsymbol{g}(\xi, \xi)]^{1 / 2}Δτ( time interval ticked off  by observer's clocks as he  moves a vector displacement ξ along his world line )=[g(ξ,ξ)]1/2
And his rods, if chosen to be sufficiently rigid, will measure the same lengths as
momentarily comoving, unaccelerated rods do. (For further discussion, see § 16.4 § 16.4 §16.4\S 16.4§16.4, and Boxes 16.2 to 16.4.)
Let the observer's coordinate system be a Cartesian latticework of rods and clocks, with the origin of the lattice always on his world line. He must keep his latticework small:
l ( spatial dimensions of lattice ) ( the acceleration measured by accelerometers he carries ) 1 1 g . l (  spatial dimensions   of lattice  ) (  the acceleration measured   by accelerometers he carries  ) 1 1 g . l-=((" spatial dimensions ")/(" of lattice "))≪((" the acceleration measured ")/(" by accelerometers he carries "))^(-1)-=(1)/(g).l \equiv\binom{\text { spatial dimensions }}{\text { of lattice }} \ll\binom{\text { the acceleration measured }}{\text { by accelerometers he carries }}^{-1} \equiv \frac{1}{g} .l( spatial dimensions  of lattice )( the acceleration measured  by accelerometers he carries )11g.
At distances l l lll away from his world line, strange things of dimensionless magnitude g l g l glg lgl happen to his lattice-e.g., the acceleration measured by accelerometers differs from g g ggg by a fractional amount g l g l ∼gl\sim g lgl (exercise 6.7); also, clocks initially synchronized with the clock on his world line get out of step (tick at different rates) by a fractional amount g l g l ∼gl\sim g lgl (exercise 6.6). (Note that an acceleration of one "earth gravity" corresponds to
g 1 10 3 sec 2 / cm 10 18 cm 1 light-year g 1 10 3 sec 2 / cm 10 18 cm 1  light-year  g^(-1)∼10^(-3)sec^(2)//cm∼10^(18)cm∼1" light-year "g^{-1} \sim 10^{-3} \sec ^{2} / \mathrm{cm} \sim 10^{18} \mathrm{~cm} \sim 1 \text { light-year }g1103sec2/cm1018 cm1 light-year 
so the restriction l 1 / g l 1 / g l≪1//gl \ll 1 / gl1/g is normally not severe.)
To deduce the results of experiments and observations performed by an accelerated observer, one can analyze them in coordinate-independent, geometric terms, and then project the results onto the basis vectors of his accelerated frame. Alternatively, one can analyze the experiments and observations in a Lorentz frame, and then transform to the accelerated frame.
As deduced in this manner, the results of experiments performed locally (at l 1 / g ) l 1 / g ) l≪1//g)l \ll 1 / \mathrm{g})l1/g) by an accelerated observer differ from the results of the same experiments performed in a Lorentz frame in only three ways:
(1) There are complicated fractional differences of order g l 1 g l 1 gl≪1g l \ll 1gl1 mentioned above, that can be made negligible by making the accelerated frame small enough.
(2) There are Coriolis forces of precisely the same type as are encountered in Newtonian theory (exercise 6.8). These the observer can get rid of by carefully preventing his latticework from rotating-e.g., by tying it to gyroscopes that he accelerates with himself by means of forces applied to their centers of mass (no torque!). Such a nonrotating latticework has "Fermi-Walker transported" basis vectors (§6.5),
(1) d e α d τ = u ( a e α ) a ( u e α ) , (1) d e α d τ = u a e α a u e α , {:(1)(de_(alpha^(')))/(d tau)=u(a*e_(alpha^(')))-a(u*e_(alpha^(')))",":}\begin{equation*} \frac{d \boldsymbol{e}_{\alpha^{\prime}}}{d \tau}=\boldsymbol{u}\left(\boldsymbol{a} \cdot \boldsymbol{e}_{\alpha^{\prime}}\right)-\boldsymbol{a}\left(\boldsymbol{u} \cdot \boldsymbol{e}_{\alpha^{\prime}}\right), \tag{1} \end{equation*}(1)deαdτ=u(aeα)a(ueα),
where u = 4 u = 4 u=4\boldsymbol{u}=4u=4-velocity, and a = d u / d τ = 4 a = d u / d τ = 4 a=du//d tau=4\boldsymbol{a}=d \boldsymbol{u} / d \tau=4a=du/dτ=4-acceleration.
(3) There are inertial forces of precisely the same type as are encountered in Newtonian theory (exercise 6.8). These are due to the observer's acceleration, and he cannot get rid of them except by stopping his accelerating.

§6.2. HYPERBOLIC MOTION

Study a rocket passenger who feels "gravity" because he is being accelerated in flat spacetime. Begin by describing his motion relative to an inertial reference frame. His 4 -velocity satisfies the condition u 2 = 1 u 2 = 1 u^(2)=-1\boldsymbol{u}^{2}=-1u2=1. To say that it is fixed in magnitude is to say that the 4 -acceleration,
(6.1) a = d u / d τ (6.1) a = d u / d τ {:(6.1)a=du//d tau:}\begin{equation*} \boldsymbol{a}=d \boldsymbol{u} / d \tau \tag{6.1} \end{equation*}(6.1)a=du/dτ
is orthogonal to the 4 -velocity:
(6.2) 0 = ( d / d τ ) ( 1 / 2 ) = ( d / d τ ) ( 1 2 u u ) = a u (6.2) 0 = ( d / d τ ) ( 1 / 2 ) = ( d / d τ ) 1 2 u u = a u {:(6.2)0=(d//d tau)(-1//2)=(d//d tau)((1)/(2)u*u)=a*u:}\begin{equation*} 0=(d / d \tau)(-1 / 2)=(d / d \tau)\left(\frac{1}{2} \boldsymbol{u} \cdot \boldsymbol{u}\right)=\boldsymbol{a} \cdot \boldsymbol{u} \tag{6.2} \end{equation*}(6.2)0=(d/dτ)(1/2)=(d/dτ)(12uu)=au
This equation implies that a 0 = 0 a 0 = 0 a^(0)=0a^{0}=0a0=0 in the rest frame of the passenger (that Lorentz frame, where, at the instant in question, u = e 0 u = e 0 u=e_(0)\boldsymbol{u}=\boldsymbol{e}_{0}u=e0 ); in this frame the space components of a μ a μ a^(mu)a^{\mu}aμ reduce to the ordinary definition of acceleration, a i = d 2 x i / d t 2 a i = d 2 x i / d t 2 a^(i)=d^(2)x^(i)//dt^(2)a^{i}=d^{2} x^{i} / d t^{2}ai=d2xi/dt2. From the components a μ = ( 0 ; a i ) a μ = 0 ; a i a^(mu)=(0;a^(i))a^{\mu}=\left(0 ; a^{i}\right)aμ=(0;ai) in the rest frame, then, one sees that the magnitude of the acceleration in the rest frame can be computed as the simple invariant
a 2 = a μ a μ = ( d 2 x / d t 2 ) 2 as measured in rest frame a 2 = a μ a μ = d 2 x / d t 2 2  as measured in rest frame  a^(2)=a^(mu)a_(mu)=(d^(2)x//dt^(2))^(2)" as measured in rest frame "\boldsymbol{a}^{2}=a^{\mu} a_{\mu}=\left(d^{2} \boldsymbol{x} / d t^{2}\right)^{2} \text { as measured in rest frame }a2=aμaμ=(d2x/dt2)2 as measured in rest frame 
Consider, for simplicity, an observer who feels always a constant acceleration g g ggg. Take the acceleration to be in the x 1 x 1 x^(1)x^{1}x1 direction of some inertial frame, and take x 2 = x 3 = 0 x 2 = x 3 = 0 x^(2)=x^(3)=0x^{2}=x^{3}=0x2=x3=0. The equations for the motion of the observer in that inertial frame become
(6.3) d t d τ = u 0 , d x d τ = u 1 ; d u 0 d τ = a 0 , d u 1 d τ = a 1 (6.3) d t d τ = u 0 , d x d τ = u 1 ; d u 0 d τ = a 0 , d u 1 d τ = a 1 {:(6.3)(dt)/(d tau)=u^(0)","quad(dx)/(d tau)=u^(1);quad(du^(0))/(d tau)=a^(0)","quad(du^(1))/(d tau)=a^(1):}\begin{equation*} \frac{d t}{d \tau}=u^{0}, \quad \frac{d x}{d \tau}=u^{1} ; \quad \frac{d u^{0}}{d \tau}=a^{0}, \quad \frac{d u^{1}}{d \tau}=a^{1} \tag{6.3} \end{equation*}(6.3)dtdτ=u0,dxdτ=u1;du0dτ=a0,du1dτ=a1
Write out the three algebraic equations
u μ u μ = 1 u μ a μ = u 0 a 0 + u 1 a 1 = 0 u μ u μ = 1 u μ a μ = u 0 a 0 + u 1 a 1 = 0 {:[u^(mu)u_(mu)=-1],[u^(mu)a_(mu)=-u^(0)a^(0)+u^(1)a^(1)=0]:}\begin{aligned} & u^{\mu} u_{\mu}=-1 \\ & u^{\mu} a_{\mu}=-u^{0} a^{0}+u^{1} a^{1}=0 \end{aligned}uμuμ=1uμaμ=u0a0+u1a1=0
and
a μ a μ = g 2 . a μ a μ = g 2 . a^(mu)a_(mu)=g^(2).a^{\mu} a_{\mu}=g^{2} .aμaμ=g2.
Solve for the acceleration, finding
(6.4) a 0 = d u 0 d τ = g u 1 , a 1 = d u 1 d τ = g u 0 . (6.4) a 0 = d u 0 d τ = g u 1 , a 1 = d u 1 d τ = g u 0 . {:(6.4)a^(0)=(du^(0))/(d tau)=gu^(1)","quada^(1)=(du^(1))/(d tau)=gu^(0).:}\begin{equation*} a^{0}=\frac{d u^{0}}{d \tau}=g u^{1}, \quad a^{1}=\frac{d u^{1}}{d \tau}=g u^{0} . \tag{6.4} \end{equation*}(6.4)a0=du0dτ=gu1,a1=du1dτ=gu0.
These linear differential equations can be solved immediately. The solution, with a suitable choice of the origin, reads
(6.5) t = g 1 sinh g τ , x = g 1 cosh g τ (6.5) t = g 1 sinh g τ , x = g 1 cosh g τ {:(6.5)t=g^(-1)sinh g tau","quad x=g^(-1)cosh g tau:}\begin{equation*} t=g^{-1} \sinh g \tau, \quad x=g^{-1} \cosh g \tau \tag{6.5} \end{equation*}(6.5)t=g1sinhgτ,x=g1coshgτ
Note that x 2 t 2 = g 2 x 2 t 2 = g 2 x^(2)-t^(2)=g^(-2)x^{2}-t^{2}=g^{-2}x2t2=g2. The world line is a hyperbola in a spacetime diagram ("hyperbolic motion"; Figure 6.1). Several interesting aspects of this motion are
Figure 6.1.
Hyperbolic motion. World line of an object that (or an observer who) experiences always a fixed acceleration g g ggg with respect to an inertial frame that is instantaneously comoving (different inertial frames at different instants!). The 4 -acceleration a a a\boldsymbol{a}a is everywhere orthogonal (Lorentz geometry!) to the 4-velocity u u u\boldsymbol{u}u.
treated in the exercises. Let the magnitude of the constant acceleration g g ggg be the acceleration of gravity, g = 980 cm / sec 2 g = 980 cm / sec 2 g=980cm//sec^(2)g=980 \mathrm{~cm} / \mathrm{sec}^{2}g=980 cm/sec2 experienced on earth: g ( 10 3 cm / sec 2 ) g 10 3 cm / sec 2 g≃(10^(3)(cm)//sec^(2))g \simeq\left(10^{3} \mathrm{~cm} / \mathrm{sec}^{2}\right)g(103 cm/sec2) / ( 3 × 10 10 cm / sec ) 2 = ( 3 × 10 7 sec 3 × 10 10 cm / sec ) 1 = ( 1 light-year ) 1 3 × 10 10 cm / sec 2 = 3 × 10 7 sec 3 × 10 10 cm / sec 1 = ( 1  light-year  ) 1 (3xx10^(10)(cm)//sec)^(2)=(3xx10^(7)sec*3xx10^(10)(cm)//sec)^(-1)=(1" light-year ")^(-1)\left(3 \times 10^{10} \mathrm{~cm} / \mathrm{sec}\right)^{2}=\left(3 \times 10^{7} \mathrm{sec} \cdot 3 \times 10^{10} \mathrm{~cm} / \mathrm{sec}\right)^{-1}=(1 \text { light-year })^{-1}(3×1010 cm/sec)2=(3×107sec3×1010 cm/sec)1=(1 light-year )1. Thus the observer will attain relativistic velocities after maintaining this acceleration for something like one year of his own proper time. He can outrun a photon if he has a head start on it of one light-year or more.

Exercise 6.1. A TRIP TO THE GALACTIC NUCLEUS

Compute the proper time required for the occupants of a rocket ship to travel the 30 , 000 30 , 000 ∼30,000\sim 30,00030,000 light-years from the Earth to the center of the Galaxy. Assume that they maintain an acceleration of one "earth gravity" ( 10 3 cm / sec 2 ) 10 3 cm / sec 2 (10^(3)(cm)//sec^(2))\left(10^{3} \mathrm{~cm} / \mathrm{sec}^{2}\right)(103 cm/sec2) for half the trip, and then decelerate at one earth gravity for the remaining half.

Exercise 6.2. ROCKET PAYLOAD

What fraction of the initial mass of the rocket can be payload for the journey considered in exercise 6.1? Assume an ideal rocket that converts rest mass into radiation and ejects all the radiation out the back of the rocket with 100 per cent efficiency and perfect collimation.

Exercise 6.3. TWIN PARADOX

(a) Show that, of all timelike world lines connecting two events C C C\mathscr{C}C and B B B\mathscr{B}B, the one with the longest lapse of proper time is the unaccelerated one. (Hint: perform the calculation in the inertial frame of the unaccelerated world line.)
(b) One twin chooses to move from C C C\mathscr{C}C to B B B\mathscr{B}B along the unaccelerated world line. Show that the other twin, by an appropriate choice of accelerations, can get from C C C\mathscr{C}C to B B B\mathscr{B}B in arbitrarily small proper time.
(c) If the second twin prefers to ride in comfort, with the acceleration he feels never exceeding one earth gravity, g g ggg, what is the shortest proper time-lapse he can achieve between a a a\mathfrak{a}a and B B B\mathscr{B}B ? Express the answer in terms of g g ggg and the proper time-lapse Δ τ Δ τ Delta tau\Delta \tauΔτ measured by the unaccelerated twin.
(d) Evaluate the answer numerically for several interesting trips.

Exercise 6.4. RADAR SPEED INDICATOR

A radar set measures velocity by emitting a signal at a standard frequency and comparing it with the frequency of the signal reflected back by another object. This redshift measurement is then converted, using the standard special-relativistic formula, into the corresponding velocity, and the radar reads out this velocity. How useful is this radar set as a velocity-measuring instrument for a uniformly accelerated observer?
(a) Consider this problem first for the special case where the object and the radar set are at rest with respect to each other at the instant the radar pulse is reflected. Compute the redshift 1 + z = ω e / ω 0 1 + z = ω e / ω 0 1+z=omega_(e)//omega_(0)1+z=\omega_{e} / \omega_{0}1+z=ωe/ω0 that the radar set measures in this case, and the resulting (incorrect) velocity it infers. Simplify by making use of the symmetries of the situation.
(b) Now consider the situation where the object has a non-zero velocity in the momentary rest frame of the observer at the instant it reflects the radar pulse. Compute the ratio of the actual relative velocity to the velocity read out by the radar set.

Exercise 6.5. RADAR DISTANCE INDICATOR

Use radar as a distance-measuring device. The radar set measures its proper time τ τ tau\tauτ between the instant at which it emits a pulse and the later instant when it receives the reflected pulse. It then performs the simple computation L 0 = τ / 2 L 0 = τ / 2 L_(0)=tau//2L_{0}=\tau / 2L0=τ/2 and supplies as output the "distance" L 0 L 0 L_(0)L_{0}L0. How accurate is the output reading of the radar set for measuring the actual distance L L LLL to the object, when used by a uniformly accelerated observer? ( L L LLL is defined as the distance in the momentary rest frame of the observer at the instant the pulse is reflected, which is at the observer's proper time halfway between emitting and receiving the pulse.) Give a correct formula relating L 0 τ / 2 L 0 τ / 2 L_(0)-=tau//2L_{0} \equiv \tau / 2L0τ/2 to the actual distance L L LLL. Show that the reading L 0 L 0 L_(0)L_{0}L0 becomes infinite as L L LLL approaches g 1 g 1 g^(-1)g^{-1}g1, where g g ggg is the observer's acceleration, as measured by an accelerometer he carries.
Difficulties in constructing "the coordinate system of an accelerated observer":
Breakdown in communication between observer and events at distance
l > l > l >l>l> (acceleration 1 1 ^(-1)^{-1}1

§6.3. CONSTRAINTS ON SIZE OF AN ACCELERATED FRAME

It is very easy to put together the words "the coordinate system of an accelerated observer," but it is much harder to find a concept these words might refer to. The most useful first remark one can make about these words is that, if taken seriously, they are self-contradictory. The definite article "the" in this phrase suggests that one is thinking of some unique coordinate system naturally associated with some specified accelerated observer, such as one whose world line is given in equation (6.5). If the coordinate system is indeed natural, one would expect that the coordinates of any event could be determined by a sufficiently ingenious observer by sending and receiving light signals. But from Figure 6.1 it is clear that the events composing one quarter of all spacetime (Zone III) can neither send light signals to, nor receive light signals from, the specified observer. Another half of spacetime suffers lesser disabilities in this respect: Zone II cannot send to the observer, Zone IV cannot receive from him. It is hard to see how the observer could define in any natural way a coordinate system covering events with which he has no causal relationship, which he cannot see, and from which he cannot be seen!
Difficulties also occur when one considers an observer who begins at rest in one frame, is accelerated for a time, and maintains thereafter a constant velocity, at rest in some other inertial coordinate system. Do his motions define in any natural way
Figure 6.2.
World line of an observer who has undergone a brief period of acceleration. In each phase of motion at constant velocity, an inertial coordinate system can be set up. However, there is no way to reconcile these discordant coordinates in the region of overlap (beginning at distance g 1 g 1 g^(-1)g^{-1}g1 to the left of the region of acceleration).
a coordinate system? Then this coordinate system (1) should be the inertial frame x μ x μ x^(mu)x^{\mu}xμ in which he was at rest for times x 0 x 0 x^(0)x^{0}x0 less than 0 , and (2) should be the other inertial frame x μ x μ x^(mu^('))x^{\mu^{\prime}}xμ for times x 0 > T x 0 > T x^(0^(')) > T^(')x^{0^{\prime}}>T^{\prime}x0>T in which he was at rest in that other frame. Evidently some further thinking would be required to decide how to define the coordinates in the regions not determined by these two conditions (Figure 6.2). More serious, however, is the fact that these two conditions are inconsistent for a region of spacetime that satisfies simultaneously x 0 < 0 x 0 < 0 x^(0) < 0x^{0}<0x0<0 and x 0 > T x 0 > T x^(0^(')) > T^(')x^{0^{\prime}}>T^{\prime}x0>T. In both examples of accelerated motion (Figures 6.1 and 6.2), the serious difficulties about defining a coordinate system begin only at a finite distance g 1 g 1 g^(-1)g^{-1}g1 from the world line of the accelerated observer. The problem evidently has no solution for distances from the world line greater than g 1 g 1 g^(-1)g^{-1}g1. It does possess a natural solution in the immediate vicinity of the observer. This solution goes under the name of "Fermi-Walker transported orthonormal tetrad." The essential idea lends itself to simple illustration for hyperbolic motion, as follows.

§6.4. THE TETRAD CARRIED BY A UNIFORMLY ACCELERATED OBSERVER

An infinitesimal version of a coordinate system is supplied by a "tetrad," or "moving frame" (Cartan's "repère mobile"), or set of basis vectors e 0 , e 1 , e 2 , e 3 e 0 , e 1 , e 2 , e 3 e_(0^(')),e_(1^(')),e_(2^(')),e_(3^('))\boldsymbol{e}_{0^{\prime}}, \boldsymbol{e}_{1^{\prime}}, \boldsymbol{e}_{2^{\prime}}, \boldsymbol{e}_{3^{\prime}}e0,e1,e2,e3 (subscript tells which vector, not which component of one vector!) Let the time axis be the time axis of a comoving inertial frame in which the observer is momentarily at rest. Thus the zeroth basis vector is identical with his 4-velocity: e 0 = u e 0 = u e_(0^('))=u\boldsymbol{e}_{0^{\prime}}=\boldsymbol{u}e0=u. The space axes e 2 e 2 e_(2)\boldsymbol{e}_{2}e2 and e 3 e 3 e_(3)\boldsymbol{e}_{3}e3 are not affected by Lorentz transformations in the 1-direction. Therefore take e 2 e 2 e_(2^('))\boldsymbol{e}_{2^{\prime}}e2 and e 3 e 3 e_(3^('))\boldsymbol{e}_{3^{\prime}}e3 to be the unit basis vectors of the all-encompassing Lorentz frame relative to which the hyperbolic motion of the observer has already been described in equations (6.5): e 2 = e 2 ; e 3 = e 3 e 2 = e 2 ; e 3 = e 3 e_(2^('))=e_(2);e_(3^('))=e_(3)\boldsymbol{e}_{2^{\prime}}=\boldsymbol{e}_{2} ; \boldsymbol{e}_{3^{\prime}}=\boldsymbol{e}_{3}e2=e2;e3=e3. The remaining basis vector, e 1 e 1 e_(1^('))\boldsymbol{e}_{1^{\prime}}e1, orthogonal to the other three, is parallel to the acceleration vector, e 1 = g 1 a e 1 = g 1 a e_(1^('))=g^(-1)a\boldsymbol{e}_{1^{\prime}}=g^{-1} \boldsymbol{a}e1=g1a [see equation (6.4)]. There is a more satisfactory way to characterize this moving frame: the time axis e 0 e 0 e_(0^('))\boldsymbol{e}_{0^{\prime}}e0 is the observer's 4 -velocity, so he is always at rest in this frame; and the
Natural coordinates inconsistent at distance l > ( acceleration ) 1 l > (  acceleration  ) 1 l > (" acceleration ")^(-1)l>(\text { acceleration })^{-1}l>( acceleration )1
Orthonormal tetrad of basis vectors carried by uniformly accelerated observer
other three vectors e 1 e 1 e_(1^('))\boldsymbol{e}_{1^{\prime}}e1 are chosen in such a way as to be (1) orthogonal and (2) nonrotating. These basis vectors are:
( e 0 ) μ = ( cosh g τ ; sinh g τ , 0 , 0 ) ; ( e 1 ) μ = ( sinh g τ ; cosh g τ , 0 , 0 ) ; ( e 2 2 ) μ = ( 0 ; 0 , 1 , 0 ) ; (6.6) ( e 3 3 ) μ = ( 0 ; 0 , 0 , 1 ) . e 0 μ = ( cosh g τ ; sinh g τ , 0 , 0 ) ; e 1 μ = ( sinh g τ ; cosh g τ , 0 , 0 ) ; e 2 2 μ = ( 0 ; 0 , 1 , 0 ) ; (6.6) e 3 3 μ = ( 0 ; 0 , 0 , 1 ) . {:[(e_(0^(')))^(mu)=(cosh g tau;sinh g tau","0","0);],[(e_(1^(')))^(mu)=(sinh g tau;cosh g tau","0","0);],[(e_(2^(2)))^(mu)=(0;0","1","0);],[(6.6)(e_(3^(3)))^(mu)=(0;0","0","1).]:}\begin{align*} & \left(e_{0^{\prime}}\right)^{\mu}=(\cosh g \tau ; \sinh g \tau, 0,0) ; \\ & \left(e_{1^{\prime}}\right)^{\mu}=(\sinh g \tau ; \cosh g \tau, 0,0) ; \\ & \left(e_{2^{2}}\right)^{\mu}=(0 ; 0,1,0) ; \\ & \left(e_{3^{3}}\right)^{\mu}=(0 ; 0,0,1) . \tag{6.6} \end{align*}(e0)μ=(coshgτ;sinhgτ,0,0);(e1)μ=(sinhgτ;coshgτ,0,0);(e22)μ=(0;0,1,0);(6.6)(e33)μ=(0;0,0,1).
There is a simple prescription to obtain these four basis vectors. Take the four basis vectors e 0 , e 1 , e 2 , e 3 e 0 , e 1 , e 2 , e 3 e_(0),e_(1),e_(2),e_(3)\boldsymbol{e}_{0}, \boldsymbol{e}_{1}, \boldsymbol{e}_{2}, \boldsymbol{e}_{3}e0,e1,e2,e3 of the original global Lorentz reference frame, and apply to them a simple boost in the 1-direction, of such a magnitude that e 0 e 0 e_(0^('))\boldsymbol{e}_{0^{\prime}}e0 comes into coincidence with the 4 -velocity of the observer. The fact that these vectors are all orthogonal to each other and of unit magnitude is formally stated by the equation
(6.7) e μ e ν = η μ ν (6.7) e μ e ν = η μ ν {:(6.7)e_(mu^('))*e_(nu^('))=eta_(mu^(')nu^(')):}\begin{equation*} \boldsymbol{e}_{\mu^{\prime}} \cdot \boldsymbol{e}_{\nu^{\prime}}=\eta_{\mu^{\prime} \nu^{\prime}} \tag{6.7} \end{equation*}(6.7)eμeν=ημν

§6.5. THE TETRAD FERMI-WALKER TRANSPORTED BY AN OBSERVER WITH ARBITRARY ACCELERATION

Turn now from an observer, or an object, in hyperbolic motion to one whose acceleration, always finite, varies arbitrarily with time. Here also we impose three criteria on the moving, infinitesimal reference frame, or tetrad: (1) the basis vectors e μ e μ e_(mu^('))\boldsymbol{e}_{\mu^{\prime}}eμ of the tetrad must remain orthonormal [equation (6.7)]; (2) the basis vectors must form a rest frame for the observer at each instant ( e 0 = u ) e 0 = u (e_(0^('))=u)\left(\boldsymbol{e}_{0^{\prime}}=\boldsymbol{u}\right)(e0=u); and (3) the tetrad should be "nonrotating."
This last criterion requires discussion. The basis vectors of the tetrad at any proper time τ τ tau\tauτ must be related to the basis vectors e 0 , e 1 , e 2 , e 3 e 0 , e 1 , e 2 , e 3 e_(0),e_(1),e_(2),e_(3)\boldsymbol{e}_{0}, \boldsymbol{e}_{1}, \boldsymbol{e}_{2}, \boldsymbol{e}_{3}e0,e1,e2,e3 of some given inertial frame by a Lorentz transformation e μ ( τ ) = Λ ν μ ( τ ) e p e μ ( τ ) = Λ ν μ ( τ ) e p e_(mu^('))(tau)=Lambda^(nu)_(mu^('))(tau)e_(p^('))\boldsymbol{e}_{\mu^{\prime}}(\tau)=\Lambda^{\nu}{ }_{\mu^{\prime}}(\tau) \boldsymbol{e}_{p^{\prime}}eμ(τ)=Λνμ(τ)ep. Therefore the basis vectors at two successive instants must also be related to each other by a Lorentz transformation. But a Lorentz transformation can be thought of as a "rotation" in spacetime. The 4 -velocity u u u\boldsymbol{u}u, always of unit magnitude, changes in direction. The very concept of acceleration therefore implies "rotation" of velocity 4 -vector. How then is the requirement of "no rotation" to be interpreted? Demand that the tetrad e μ ( τ ) e μ ( τ ) e_(mu^('))(tau)\boldsymbol{e}_{\mu^{\prime}}(\tau)eμ(τ) change from instant to instant by precisely that amount implied by the rate of change of u = e 0 u = e 0 u=e_(0)\boldsymbol{u}=\boldsymbol{e}_{0}u=e0, and by no additional arbitrary rotation. In other words, (1) accept the inevitable pseudorotation in the timelike plane defined by the velocity 4 -vector and the acceleration, but (2) rule out any ordinary rotation of the three space vectors.
Nonrelativistic physics describes the rotation of a vector (components v i v i v_(i)v_{i}vi ) by an instantaneous angular velocity vector (components ω i ω i omega_(i)\omega_{i}ωi ). This angular velocity appears in the formula for the rate of change of v v vvv,
(6.8) ( d v i / d t ) = ( ω × v ) i = ε i j k ω j v k (6.8) d v i / d t = ( ω × v ) i = ε i j k ω j v k {:(6.8)(dv_(i)//dt)=(omega xx v)_(i)=epsi_(ijk)omega_(j)v_(k):}\begin{equation*} \left(d v_{i} / d t\right)=(\omega \times v)_{i}=\varepsilon_{i j k} \omega_{j} v_{k} \tag{6.8} \end{equation*}(6.8)(dvi/dt)=(ω×v)i=εijkωjvk
For the extension to four-dimensional spacetime, it is helpful to think of the rotation
as occurring in the plane perpendicular to the angular velocity vector ω ω omega\omegaω. Thus rewrite (6.8) as
(6.9) d v i / d t = Ω i k v k (6.9) d v i / d t = Ω i k v k {:(6.9)dv_(i)//dt=-Omega_(ik)v_(k):}\begin{equation*} d v_{i} / d t=-\Omega_{i k} v_{k} \tag{6.9} \end{equation*}(6.9)dvi/dt=Ωikvk
where
(6.10) Ω j k = Ω k j = ω i ε i j k (6.10) Ω j k = Ω k j = ω i ε i j k {:(6.10)Omega_(jk)=-Omega_(kj)=omega_(i)epsi_(ijk):}\begin{equation*} \Omega_{j k}=-\Omega_{k j}=\omega_{i} \varepsilon_{i j k} \tag{6.10} \end{equation*}(6.10)Ωjk=Ωkj=ωiεijk
has non-zero components only in the plane of the rotation. In other words, to speak of "a rotation in the ( 1 , 2 ) ( 1 , 2 ) (1,2)(1,2)(1,2)-plane" is more useful than to speak of a rotation about the 3 -axis. The concept of "plane of rotation" carries over to four dimensions. There a rotation in the ( 1 , 2 ) ( 1 , 2 ) (1,2)(1,2)(1,2)-plane will leave constant not only the v 3 v 3 v_(3)v_{3}v3 but also the v 0 v 0 v_(0)v_{0}v0 component of the velocity. The four-dimensional definition of a rotation is
(6.11) d v μ d τ = Ω μ ν v ν , with Ω μ ν = Ω ν μ . (6.11) d v μ d τ = Ω μ ν v ν ,  with  Ω μ ν = Ω ν μ . {:(6.11)(dv^(mu))/(d tau)=-Omega^(mu nu)v_(nu)","quad" with "quadOmega^(mu nu)=-Omega^(nu mu).:}\begin{equation*} \frac{d v^{\mu}}{d \tau}=-\Omega^{\mu \nu} v_{\nu}, \quad \text { with } \quad \Omega^{\mu \nu}=-\Omega^{\nu \mu} . \tag{6.11} \end{equation*}(6.11)dvμdτ=Ωμνvν, with Ωμν=Ωνμ.
To test the appropriateness of this definition of a generalized rotation or infinitesimal Lorentz transformation, verify that it leaves invariant the length of the 4 -vector:
(6.12) d ( v μ u μ ) / d τ = 2 v μ ( d v μ / d τ ) = 2 Ω μ ν v μ v ν = 0 (6.12) d v μ u μ / d τ = 2 v μ d v μ / d τ = 2 Ω μ ν v μ v ν = 0 {:(6.12)d(v_(mu)u^(mu))//d tau=2v_(mu)(dv^(mu)//d tau)=-2Omega^(mu nu)v_(mu)v_(nu)=0:}\begin{equation*} d\left(v_{\mu} u^{\mu}\right) / d \tau=2 v_{\mu}\left(d v^{\mu} / d \tau\right)=-2 \Omega^{\mu \nu} v_{\mu} v_{\nu}=0 \tag{6.12} \end{equation*}(6.12)d(vμuμ)/dτ=2vμ(dvμ/dτ)=2Ωμνvμvν=0
The last expression vanishes because Ω μ ν Ω μ ν Omega^(mu nu)\Omega^{\mu \nu}Ωμν is antisymmetric, whereas v μ v ν v μ v ν v_(mu)v_(nu)v_{\mu} v_{\nu}vμvν is symmetric. Note also that the antisymmetric tensor Ω μ ν Ω μ ν Omega^(mu nu)\Omega^{\mu \nu}Ωμν ("rotation matrix"; "infinitesimal Lorentz transformation") has 4 × 3 / 2 = 6 4 × 3 / 2 = 6 4xx3//2=64 \times 3 / 2=64×3/2=6 independent components. This number agrees with the number of components in a finite Lorentz transformation (three parameters for rotations, plus three parameters for the components of a boost). The "infinitesimal Lorentz transformation" here must (1) generate the appropriate Lorentz transformation in the timelike plane spanned by the 4 -velocity and the 4 -acceleration, and (2) exclude a rotation in any other plane, in particular, in any spacelike plane. The unique answer to these requirements is
(6.13) Ω μ ν = a μ u ν a ν u μ ; i.e., Ω = a u . (6.13) Ω μ ν = a μ u ν a ν u μ ;  i.e.,  Ω = a u . {:(6.13)Omega^(mu nu)=a^(mu)u^(nu)-a^(nu)u^(mu);quad" i.e., "Omega=a^^u.:}\begin{equation*} \Omega^{\mu \nu}=a^{\mu} u^{\nu}-a^{\nu} u^{\mu} ; \quad \text { i.e., } \Omega=\boldsymbol{a} \wedge \boldsymbol{u} . \tag{6.13} \end{equation*}(6.13)Ωμν=aμuνaνuμ; i.e., Ω=au.
Apply this rotation to a spacelike vector w w w\boldsymbol{w}w orthogonal to u u u\boldsymbol{u}u and a , ( u w = 0 a , ( u w = 0 a,(u*w=0\boldsymbol{a},(\boldsymbol{u} \cdot \boldsymbol{w}=0a,(uw=0 and a w = 0 a w = 0 a*w=0\boldsymbol{a} \cdot \boldsymbol{w}=0aw=0 ). Immediately compute Ω μ ν w v = 0 Ω μ ν w v = 0 Omega^(mu nu)w_(v)=0\Omega^{\mu \nu} w_{v}=0Ωμνwv=0. Thus verify the absence of any space rotation. Now check the over-all normalization of Ω μ ν Ω μ ν Omega^(mu nu)\Omega^{\mu \nu}Ωμν in equation (6.13). Apply the infinitesimal Lorentz transformation to the velocity 4 -vector u u u\boldsymbol{u}u of the observer. Thus insert v μ = u μ v μ = u μ v^(mu)=u^(mu)v^{\mu}=u^{\mu}vμ=uμ in (6.11). It then reads
d u μ / d τ a μ = u μ ( a ν u ν ) a μ ( u ν u ν ) = a μ . d u μ / d τ a μ = u μ a ν u ν a μ u ν u ν = a μ . du^(mu)//d tau-=a^(mu)=u^(mu)(a^(nu)u_(nu))-a^(mu)(u^(nu)u_(nu))=a^(mu).d u^{\mu} / d \tau \equiv a^{\mu}=u^{\mu}\left(a^{\nu} u_{\nu}\right)-a^{\mu}\left(u^{\nu} u_{\nu}\right)=a^{\mu} .duμ/dτaμ=uμ(aνuν)aμ(uνuν)=aμ.
This result is an identity, since u u = 1 u u = 1 u*u=-1\boldsymbol{u} \cdot \boldsymbol{u}=-1uu=1 and u a = 0 u a = 0 u*a=0\boldsymbol{u} \cdot \boldsymbol{a}=0ua=0.
A vector v v v\boldsymbol{v}v that undergoes the indicated infinitesimal Lorentz transformation,
(6.14) d v μ / d τ = ( u μ a ν u ν a μ ) v p (6.14) d v μ / d τ = u μ a ν u ν a μ v p {:(6.14)dv^(mu)//d tau=(u^(mu)a^(nu)-u^(nu)a^(mu))v_(p):}\begin{equation*} d v^{\mu} / d \tau=\left(u^{\mu} a^{\nu}-u^{\nu} a^{\mu}\right) v_{p} \tag{6.14} \end{equation*}(6.14)dvμ/dτ=(uμaνuνaμ)vp
Fermi-Walker law of transport for "nonrotating" tetrad of basis vectors carried by an accelerated observer
is said to experience "Fermi-Walker transport" along the world line of the observer.
Mathematics of rotation in spacetime
Figure 6.3.
Construction of spacelike hyperplanes (dashed) orthogonal to the world line (heavy line) of an accelerated particle at selected moments along that world line. Note crossing of hyperplanes at distance g 1 ( τ ) g 1 ( τ ) g^(-1)(tau)g^{-1}(\tau)g1(τ) (time-dependent acceleration!) from the world line.
The natural moving frame associated with an accelerated observer consists of four orthonormal vectors, each of which is Fermi-Walker transported along the world line and one of which is e 0 = u e 0 = u e_(0^('))=u\boldsymbol{e}_{0^{\prime}}=\boldsymbol{u}e0=u (the 4 -velocity of the observer). Fermi-Walker transport of the space basis vectors e j e j e_(j^('))\boldsymbol{e}_{j^{\prime}}ej can be achieved in practice by attaching them to gyroscopes (see Box 6.2 and exercise 6.9).

§6.6. THE LOCAL COORDINATE SYSTEM OF AN ACCELERATED OBSERVER

Tetrad used to construct "local coordinate system of accelerated observer"
"coordinates relative to the accelerated observer." In detail, the prescription for the determination of these four coordinates consists of the four equations
(6.16) x μ = ξ k [ e k ( τ ) ] μ + z μ ( τ ) , (6.16) x μ = ξ k e k ( τ ) μ + z μ ( τ ) , {:(6.16)x^(mu)=xi^(k^('))[e_(k^('))(tau)]^(mu)+z^(mu)(tau)",":}\begin{equation*} x^{\mu}=\xi^{k^{\prime}}\left[e_{k^{\prime}}(\tau)\right]^{\mu}+z^{\mu}(\tau), \tag{6.16} \end{equation*}(6.16)xμ=ξk[ek(τ)]μ+zμ(τ),
in which the x μ x μ x^(mu)x^{\mu}xμ are considered as known, and the coordinates τ , ξ k τ , ξ k tau,xi^(k^('))\tau, \xi^{k^{\prime}}τ,ξk are considered unknowns.
At a certain distance from the accelerated world line, successive spacelike hyperplanes, instead of advancing with increasing τ τ tau\tauτ, will be retrogressing. At this distance, and at greater distances, the concept of "coordinates relative to the accelerated observer" becomes ambiguous and has to be abandoned. To evaluate this distance, note that any sufficiently short section of the world line can be approximated by a hyperbola ("hyperbolic motion with acceleration g g ggg "), where the time-dependent acceleration g ( τ ) g ( τ ) g(tau)g(\tau)g(τ) is given by the equation g 2 = a μ a μ g 2 = a μ a μ g^(2)=a^(mu)a_(mu)g^{2}=a^{\mu} a_{\mu}g2=aμaμ.
Apply the above general prescription to hyperbolic motion, arriving at the equations
x 0 = ( g 1 + ξ 1 ) sinh ( g ξ 0 ) , x 1 = ( g 1 + ξ 1 1 ) cosh ( g ξ ξ ) , x 2 = ξ 2 , (6.17) x 3 = ξ 3 . x 0 = g 1 + ξ 1 sinh g ξ 0 , x 1 = g 1 + ξ 1 1 cosh g ξ ξ , x 2 = ξ 2 , (6.17) x 3 = ξ 3 . {:[x^(0)=(g^(-1)+xi^(1))sinh(gxi^(0^(')))","],[x^(1)=(g^(-1)+xi^(1^(1)))cosh(gxi^(xi^(')))","],[x^(2)=xi^(2^('))","],[(6.17)x^(3)=xi^(3^(')).]:}\begin{align*} & x^{0}=\left(g^{-1}+\xi^{1}\right) \sinh \left(g \xi^{0^{\prime}}\right), \\ & x^{1}=\left(g^{-1}+\xi^{1^{1}}\right) \cosh \left(g \xi^{\xi^{\prime}}\right), \\ & x^{2}=\xi^{2^{\prime}}, \\ & x^{3}=\xi^{3^{\prime}} . \tag{6.17} \end{align*}x0=(g1+ξ1)sinh(gξ0),x1=(g1+ξ11)cosh(gξξ),x2=ξ2,(6.17)x3=ξ3.
The surfaces of constant ξ 0 ξ 0 xi^(0^('))\xi^{0^{\prime}}ξ0 are the hyperplanes with x 0 / x 1 = tanh g ξ g x 0 / x 1 = tanh g ξ g x^(0)//x^(1)=tanh gxi^(g^('))x^{0} / x^{1}=\tanh g \xi^{g^{\prime}}x0/x1=tanhgξg sketched in Figure 6.4. Substitute expressions (6.17) into the Minkowski formula for the line element to find
d s 2 = η μ ν d x μ d x ν (6.18) = ( 1 + g ξ 1 ) 2 ( d ξ ) 2 + ( d ξ 1 ) 2 + ( d ξ 2 ) 2 + ( d ξ 3 ) 2 . d s 2 = η μ ν d x μ d x ν (6.18) = 1 + g ξ 1 2 d ξ 2 + d ξ 1 2 + d ξ 2 2 + d ξ 3 2 . {:[ds^(2)=eta_(mu nu)dx^(mu)dx^(nu)],[(6.18)=-(1+gxi^(1))^(2)(dxi^('))^(2)+(dxi^(1))^(2)+(dxi^(2))^(2)+(dxi^(3))^(2).]:}\begin{align*} d s^{2} & =\eta_{\mu \nu} d x^{\mu} d x^{\nu} \\ & =-\left(1+g \xi^{1}\right)^{2}\left(d \xi^{\prime}\right)^{2}+\left(d \xi^{1}\right)^{2}+\left(d \xi^{2}\right)^{2}+\left(d \xi^{3}\right)^{2} . \tag{6.18} \end{align*}ds2=ημνdxμdxν(6.18)=(1+gξ1)2(dξ)2+(dξ1)2+(dξ2)2+(dξ3)2.
Figure 6.4.
Local coordinate system associated with an observer in hyperbolic motion (heavy black world line). The local coordinate system fails for ξ 1 ξ 1 xi^(1)\xi^{1}ξ1 less than g 1 g 1 -g^(-1)-g^{-1}g1.
The coefficients of d ξ μ d ξ ν d ξ μ d ξ ν dxi^(mu^('))dxi^(nu^('))d \xi^{\mu^{\prime}} d \xi^{\nu^{\prime}}dξμdξν in this expansion are not the standard Lorentz metric components. The reason is clear. The ξ μ ξ μ xi^(mu^('))\xi^{\mu^{\prime}}ξμ do not form an inertial coordinate system. However, at the position of the observer, ξ 1 = 0 ξ 1 = 0 xi^(1^('))=0\xi^{1^{\prime}}=0ξ1=0, the coefficients reduce to the standard form. Therefore these "local coordinates" approximate a Lorentz coordinate system in the immediate neighborhood of the observer.

EXERCISES

Exercise 6.6. CLOCK RATES VERSUS COORDINATE TIME IN ACCELERATED COORDINATES

Let a clock be attached to each grid point, ( ξ 1 , ξ 2 , ξ 3 ) = ξ 1 , ξ 2 , ξ 3 = (xi^(1),xi^(2),xi^(3^(')))=\left(\xi^{1}, \xi^{2}, \xi^{3^{\prime}}\right)=(ξ1,ξ2,ξ3)= constant, of the local coordinate system of an accelerated observer. Assume for simplicity that the observer is in hyperbolic motion. Use equation (6.18) to show that proper time as measured by a lattice clock differs from coordinate time at its lattice point:
d τ / d ξ 0 = 1 + g ξ 1 d τ / d ξ 0 = 1 + g ξ 1 d tau//dxi^(0^('))=1+gxi^(1^('))d \tau / d \xi^{0^{\prime}}=1+g \xi^{1^{\prime}}dτ/dξ0=1+gξ1
(Of course, very near the observer, at ξ 1 g 1 ξ 1 g 1 xi^(1^('))≪g^(-1)\xi^{1^{\prime}} \ll g^{-1}ξ1g1, the discrepancy is negligible.)

Exercise 6.7. ACCELERATION OF LATTICE POINTS IN ACCELERATED COORDINATES

Let an accelerometer be attached to each grid point of the local coordinates of an observer in hyperbolic motion. Calculate the magnitude of the acceleration measured by the accelerometer at ( ξ 1 , ξ 2 , ξ 3 ) ξ 1 , ξ 2 , ξ 3 (xi^(1^(')),xi^(2^(')),xi^(3^(')))\left(\xi^{1^{\prime}}, \xi^{2^{\prime}}, \xi^{3^{\prime}}\right)(ξ1,ξ2,ξ3).

Exercise 6.8. OBSERVER WITH ROTATING TETRAD

An observer moving along an arbitrarily accelerated world line chooses not to Fermi-Walker transport his orthonormal tetrad. Instead, he allows it to rotate. The antisymmetric rotation tensor Ω Ω Omega\boldsymbol{\Omega}Ω that enters into his transport law
(6.19) d e α / d τ = Ω e α (6.19) d e α / d τ = Ω e α {:(6.19)de_(alpha^('))//d tau=-Omega*e_(alpha^(')):}\begin{equation*} d \boldsymbol{e}_{\alpha^{\prime}} / d \tau=-\boldsymbol{\Omega} \cdot \boldsymbol{e}_{\alpha^{\prime}} \tag{6.19} \end{equation*}(6.19)deα/dτ=Ωeα
splits into a Fermi-Walker part plus a spatial rotation part:
(6.20) Ω μ ν = a μ u ν a ν u μ Ω ( F W ) μ ν + u α ω β ϵ α β μ ν Ω ( S R ) μ v (6.20) Ω μ ν = a μ u ν a ν u μ Ω ( F W ) μ ν + u α ω β ϵ α β μ ν Ω ( S R ) μ v {:(6.20)Omega^(mu nu)=ubrace(a^(mu)u^(nu)-a^(nu)u^(mu)ubrace)_(Omega_((FW))^(mu nu))+ubrace(u_(alpha)omega_(beta)epsilon^(alpha beta mu nu)ubrace)_(Omega_((SR))^(mu v)):}\begin{equation*} \Omega^{\mu \nu}=\underbrace{a^{\mu} u^{\nu}-a^{\nu} u^{\mu}}_{\Omega_{(F W)}^{\mu \nu}}+\underbrace{u_{\alpha} \omega_{\beta} \epsilon^{\alpha \beta \mu \nu}}_{\Omega_{(S R)}^{\mu v}} \tag{6.20} \end{equation*}(6.20)Ωμν=aμuνaνuμΩ(FW)μν+uαωβϵαβμνΩ(SR)μv
ω = ω = omega=\boldsymbol{\omega}=ω= a vector orthogonal to 4 -velocity u u u\boldsymbol{u}u.
(a) The observer chooses his time basis vector to be e 0 = u e 0 = u e_(0^('))=u\boldsymbol{e}_{0^{\prime}}=\boldsymbol{u}e0=u. Show that this choice is permitted by his transport law (6.19), (6.20).
(b) Show that Ω ( S R ) μ ν Ω ( S R ) μ ν Omega_((SR))^(mu nu)\Omega_{(S R)}^{\mu \nu}Ω(SR)μν produces a rotation in the plane perpendicular to u u u\boldsymbol{u}u and ω ω omega\boldsymbol{\omega}ω-i.e., that
(6.21) Ω ( S R ) u = 0 , Ω ( S R ) w = 0 (6.21) Ω ( S R ) u = 0 , Ω ( S R ) w = 0 {:(6.21)Omega_((SR))*u=0","quadOmega_((SR))*w=0:}\begin{equation*} \Omega_{(S R)} \cdot \boldsymbol{u}=0, \quad \Omega_{(S R)} \cdot \boldsymbol{w}=0 \tag{6.21} \end{equation*}(6.21)Ω(SR)u=0,Ω(SR)w=0
(c) Suppose the accelerated observer Fermi-Walker transports a second orthonormal tetrad e α e α e_(alpha^(''))\boldsymbol{e}_{\alpha^{\prime \prime}}eα. Show that the space vectors of his first tetrad rotate relative to those of his second tetrad with angular velocity vector equal to ω ω omega\boldsymbol{\omega}ω. Hint: At a moment when the tetrads coincide, show that (in three-dimensional notation, referring to the 3 -space orthogonal to the observer's world line):
(6.22) d ( e j e j ) / d τ = ω × e j (6.22) d e j e j / d τ = ω × e j {:(6.22)d(e_(j^('))-e_(j^('')))//d tau=omega xxe_(j^(')):}\begin{equation*} d\left(e_{j^{\prime}}-e_{j^{\prime \prime}}\right) / d \tau=\omega \times e_{j^{\prime}} \tag{6.22} \end{equation*}(6.22)d(ejej)/dτ=ω×ej
(d) The observer uses the same prescription [equation (6.16)] to set up local coordinates based on his rotating tetrad as for his Fermi-Walker tetrad. Pick an event Q Q Q\mathscr{Q}Q on the observer's world line, set τ = 0 τ = 0 tau=0\tau=0τ=0 there, and choose the original inertial frame of prescription (6,16) so (1) it comoves with the accelerated observer at Q , ( 2 ) Q , ( 2 ) Q,(2)\mathcal{Q},(2)Q,(2) its origin is at Q Q Q\mathscr{Q}Q, and (3) its axes coincide with the accelerated axes at Q Q Q\mathscr{Q}Q. Show that these conditions translate into
(6.23) z μ ( 0 ) = 0 , e α ( 0 ) = e α (6.23) z μ ( 0 ) = 0 , e α ( 0 ) = e α {:(6.23)z^(mu)(0)=0","quade_(alpha^('))(0)=e_(alpha^(')):}\begin{equation*} z^{\mu}(0)=0, \quad \boldsymbol{e}_{\alpha^{\prime}}(0)=\boldsymbol{e}_{\alpha^{\prime}} \tag{6.23} \end{equation*}(6.23)zμ(0)=0,eα(0)=eα
(e) Show that near Q Q Q\mathscr{Q}Q, equations ( 6.16 ) ( 6.16 ) (6.16)(6.16)(6.16) for the rotating, accelerated coordinates reduce to:
(6.24) x 0 = ξ 0 + a k ξ k ξ 0 + O ( [ ξ α ] 3 ) x j = ξ j + 1 2 a j ξ 0 2 + ϵ j k i ω k ξ ξ 0 + O ( [ ξ α ] 3 ) (6.24) x 0 = ξ 0 + a k ξ k ξ 0 + O ξ α 3 x j = ξ j + 1 2 a j ξ 0 2 + ϵ j k i ω k ξ ξ 0 + O ξ α 3 {:[(6.24)x^(0)=xi^(0^('))+a_(k^('))xi^(k^('))xi^(0^('))+O([xi^(alpha^('))]^(3))],[x^(j)=xi^(j^('))+(1)/(2)a^(j)xi^(0^('2))+epsilon^(jki)omega^(k)xi^(ℓ^('))xi^(0^('))+O([xi^(alpha^('))]^(3))]:}\begin{align*} & x^{0}=\xi^{0^{\prime}}+a_{k^{\prime}} \xi^{k^{\prime}} \xi^{0^{\prime}}+O\left(\left[\xi^{\alpha^{\prime}}\right]^{3}\right) \tag{6.24}\\ & x^{j}=\xi^{j^{\prime}}+\frac{1}{2} a^{j} \xi^{0^{\prime 2}}+\epsilon^{j k i} \omega^{k} \xi^{\ell^{\prime}} \xi^{0^{\prime}}+O\left(\left[\xi^{\alpha^{\prime}}\right]^{3}\right) \end{align*}(6.24)x0=ξ0+akξkξ0+O([ξα]3)xj=ξj+12ajξ02+ϵjkiωkξξ0+O([ξα]3)
(f) A freely moving particle passes through the event Q Q Q\mathcal{Q}Q with ordinary velocity v v v\boldsymbol{v}v as measured in the inertial frame. By transforming its straight world line x j = v j x 0 x j = v j x 0 x^(j)=v^(j)x^(0)x^{j}=v^{j} x^{0}xj=vjx0 to the accelerated, rotating coordinates, show that its coordinate velocity and acceleration there are:
( d ξ j / d ξ 0 ) at 2 = v j ; (6.25) ( d 2 ξ j / d ξ 0 2 2 ) at 2 = a j an 2 ϵ j k i ω k v l Coriolis acceleration + 2 v j a k v k relativisitc correction to inertial acceleration . inertial acceleration d ξ j / d ξ 0 at 2 = v j ; (6.25) d 2 ξ j / d ξ 0 2 2 at 2 = a j an  2 ϵ j k i ω k v l  Coriolis   acceleration  + 2 v j a k v k  relativisitc   correction to   inertial acceleration  .  inertial   acceleration  {:[quad(dxi^(j^('))//dxi^(0))_(at2)=v^(j);],[(6.25)(d^(2)xi^(j)//dxi^(0^(2)2))_(at2)=ubrace(-a^(j)ubrace)_("an ")-ubrace(2epsilon^(jki)omega^(k)v^(l)ubrace)_({:[" Coriolis "],[" acceleration "]:})+ubrace(2v^(j)a^(k)v^(k)ubrace)_({:[" relativisitc "],[" correction to "],[" inertial acceleration "]:}).],[" inertial "],[" acceleration "]:}\begin{align*} & \quad\left(d \xi^{j^{\prime}} / d \xi^{0}\right)_{\mathrm{at} 2}=v^{j} ; \\ & \left(d^{2} \xi^{j} / d \xi^{0^{2} 2}\right)_{\mathrm{at} 2}=\underbrace{-a^{j}}_{\text {an }}-\underbrace{2 \epsilon^{j k i} \omega^{k} v^{l}}_{\begin{array}{l} \text { Coriolis } \\ \text { acceleration } \end{array}}+\underbrace{2 v^{j} a^{k} v^{k}}_{\begin{array}{l} \text { relativisitc } \\ \text { correction to } \\ \text { inertial acceleration } \end{array}} . \tag{6.25}\\ & \text { inertial } \\ & \text { acceleration } \end{align*}(dξj/dξ0)at2=vj;(6.25)(d2ξj/dξ022)at2=ajan 2ϵjkiωkvl Coriolis  acceleration +2vjakvk relativisitc  correction to  inertial acceleration . inertial  acceleration 

Exercise 6.9. THOMAS PRECESSION

Consider a spinning body (gyroscope, electron, ...) that accelerates because forces act at its center of mass. Such forces produce no torque; so they leave the body's intrinsic angularmomentum vector S S S\boldsymbol{S}S unchanged, except for the unique rotation in the u a u a u^^a\boldsymbol{u} \wedge \boldsymbol{a}ua plane required to keep S S S\boldsymbol{S}S orthogonal to the 4 -velocity u u u\boldsymbol{u}u. Mathematically speaking, the body Fermi-Walker transports its angular momentum (no rotation in planes other than u a u a u^^a\boldsymbol{u} \wedge \boldsymbol{a}ua ):
(6.26) d S / d τ = ( u a ) S (6.26) d S / d τ = ( u a ) S {:(6.26)dS//d tau=(u^^a)*S:}\begin{equation*} d \boldsymbol{S} / d \tau=(\boldsymbol{u} \wedge \boldsymbol{a}) \cdot \boldsymbol{S} \tag{6.26} \end{equation*}(6.26)dS/dτ=(ua)S
This transport law applies to a spinning electron that moves in a circular orbit of radius r r rrr around an atomic nucleus. As seen in the laboratory frame, the electron moves in the x x xxx, y y yyy-plane with constant angular velocity, ω ω omega\omegaω. At time t = 0 t = 0 t=0t=0t=0, the electron is at x = r , y = 0 x = r , y = 0 x=r,y=0x=r, y=0x=r,y=0; and its spin (as treated classically) has components
S 0 = 0 , S x = 1 2 , S y = 0 , S z = 1 2 S 0 = 0 , S x = 1 2 , S y = 0 , S z = 1 2 S^(0)=0,quadS^(x)=(1)/(sqrt2)ℏ,quadS^(y)=0,quadS^(z)=(1)/(2)ℏS^{0}=0, \quad S^{x}=\frac{1}{\sqrt{2}} \hbar, \quad S^{y}=0, \quad S^{z}=\frac{1}{2} \hbarS0=0,Sx=12,Sy=0,Sz=12
Calculate the subsequent behavior of the spin as a function of laboratory time, S μ ( t ) S μ ( t ) S^(mu)(t)S^{\mu}(t)Sμ(t). Answer:
S x = 1 2 ( cos ω t cos ω γ t + γ sin ω t sin ω γ t ) (6.27) S y = 1 2 ( sin ω t cos ω γ t γ cos ω t sin ω γ t ) S z = 1 2 ; S 0 = 1 2 v γ sin ω γ t v = ω r ; γ = ( 1 v 2 ) 1 / 2 S x = 1 2 ( cos ω t cos ω γ t + γ sin ω t sin ω γ t ) (6.27) S y = 1 2 ( sin ω t cos ω γ t γ cos ω t sin ω γ t ) S z = 1 2 ; S 0 = 1 2 v γ sin ω γ t v = ω r ; γ = 1 v 2 1 / 2 {:[S^(x)=(1)/(sqrt2)ℏ(cos omega t cos omega gamma t+gamma sin omega t sin omega gamma t)],[(6.27)S^(y)=(1)/(sqrt2)ℏ(sin omega t cos omega gamma t-gamma cos omega t sin omega gamma t)],[S^(z)=(1)/(2)ℏ;quadS^(0)=-(1)/(sqrt2)ℏv gamma sin omega gamma t],[v=omega r;quad gamma=(1-v^(2))^(-1//2)]:}\begin{align*} S^{x} & =\frac{1}{\sqrt{2}} \hbar(\cos \omega t \cos \omega \gamma t+\gamma \sin \omega t \sin \omega \gamma t) \\ S^{y} & =\frac{1}{\sqrt{2}} \hbar(\sin \omega t \cos \omega \gamma t-\gamma \cos \omega t \sin \omega \gamma t) \tag{6.27}\\ S^{z} & =\frac{1}{2} \hbar ; \quad S^{0}=-\frac{1}{\sqrt{2}} \hbar v \gamma \sin \omega \gamma t \\ v & =\omega r ; \quad \gamma=\left(1-v^{2}\right)^{-1 / 2} \end{align*}Sx=12(cosωtcosωγt+γsinωtsinωγt)(6.27)Sy=12(sinωtcosωγtγcosωtsinωγt)Sz=12;S0=12vγsinωγtv=ωr;γ=(1v2)1/2
Rewrite the time-dependent spatial part of this as
(6.28) S x + i S y = 2 [ e i ( γ 1 ) ω t + i ( 1 γ ) sin ( ω γ t ) e i ω t ] (6.28) S x + i S y = 2 e i ( γ 1 ) ω t + i ( 1 γ ) sin ( ω γ t ) e i ω t {:(6.28)S^(x)+iS^(y)=(ℏ)/(sqrt2)[e^(-i(gamma-1)omega t)+i(1-gamma)sin(omega gamma t)e^(i omega t)]:}\begin{equation*} S^{x}+i S^{y}=\frac{\hbar}{\sqrt{2}}\left[e^{-i(\gamma-1) \omega t}+i(1-\gamma) \sin (\omega \gamma t) e^{i \omega t}\right] \tag{6.28} \end{equation*}(6.28)Sx+iSy=2[ei(γ1)ωt+i(1γ)sin(ωγt)eiωt]
The first term rotates steadily in a retrograde direction with angular velocity
ω Thomas = ( γ 1 ) ω (6.29) 1 2 v 2 ω ω if v 1 . ω Thomas  = ( γ 1 ) ω (6.29) 1 2 v 2 ω ω  if  v 1 . {:[omega_("Thomas ")=(gamma-1)omega],[(6.29)~~(1)/(2)v^(2)omega≪omega" if "v≪1.]:}\begin{align*} \omega_{\text {Thomas }} & =(\gamma-1) \omega \\ & \approx \frac{1}{2} v^{2} \omega \ll \omega \text { if } v \ll 1 . \tag{6.29} \end{align*}ωThomas =(γ1)ω(6.29)12v2ωω if v1.
It is called the Thomas precession. The second term rotates in a righthanded manner for part of an orbit ( 0 < ω γ t < π ) ( 0 < ω γ t < π ) (0 < omega gamma t < pi)(0<\omega \gamma t<\pi)(0<ωγt<π) and in a lefthanded manner for the rest ( π < ω γ t < 2 π π < ω γ t < 2 π pi < omega gamma t < 2pi\pi<\omega \gamma t<2 \piπ<ωγt<2π ). Averaged in time, it does nothing. Moreover, in an atom it is very small ( γ 1 1 ) ( γ 1 1 ) (gamma-1≪1)(\gamma-1 \ll 1)(γ11). It must be present, superimposed on the Thomas precession, in order to keep
(6.30) s u = S u S 0 u 0 = 0 (6.30) s u = S u S 0 u 0 = 0 {:(6.30)s*u=S*u-S^(0)u^(0)=0:}\begin{equation*} \boldsymbol{s} \cdot \boldsymbol{u}=\boldsymbol{S} \cdot \boldsymbol{u}-S^{0} u^{0}=0 \tag{6.30} \end{equation*}(6.30)su=SuS0u0=0
and
(6.31) S 2 = S 2 ( S 0 ) 2 = 3 2 / 4 = constant . (6.31) S 2 = S 2 S 0 2 = 3 2 / 4 =  constant  . {:(6.31)S^(2)=S^(2)-(S^(0))^(2)=3ℏ^(2)//4=" constant ".:}\begin{equation*} \boldsymbol{S}^{2}=\boldsymbol{S}^{2}-\left(S^{0}\right)^{2}=3 \hbar^{2} / 4=\text { constant } . \tag{6.31} \end{equation*}(6.31)S2=S2(S0)2=32/4= constant .
It comes into play with righthanded rotation when S u S u S*u\boldsymbol{S} \cdot \boldsymbol{u}Su is negative; it goes out of play when S u = 0 S u = 0 S*u=0\boldsymbol{S} \cdot \boldsymbol{u}=0Su=0; and it returns with lefthanded rotation when S u S u S*u\boldsymbol{S} \cdot \boldsymbol{u}Su turns positive.
The Thomas precession can be understood, alternatively, as a spatial rotation that results from the combination of successive boosts in slightly different directions. [See, e.g., exercise 103 of Taylor and Wheeler (1966).] For an alternative derivation of the Thomas precession (6.29) from "spinor formalism," see §41.4.

CHAPTER

INCOMPATIBILITY OF GRAVITY AND SPECIAL RELATIVITY

§7.1. ATTEMPTS TO INCORPORATE GRAVITY INTO SPECIAL RELATIVITY

The discussion of special relativity so far has consistently assumed an absence of gravitational fields. Why must gravity be ignored in special relativity? This chapter describes the difficulties that gravitational fields cause in the foundations of special relativity. After meeting these difficulties, one can appreciate fully the curved-spacetime methods that Einstein introduced to overcome them.
Start, then, with what one already knows about gravity, Newton's formulation of its laws:
(7.1) d 2 x i / d t 2 = Φ / x i , (7.2) 2 Φ = 4 π G ρ . (7.1) d 2 x i / d t 2 = Φ / x i , (7.2) 2 Φ = 4 π G ρ . {:[(7.1)d^(2)x^(i)//dt^(2)=-del Phi//delx^(i)","],[(7.2)grad^(2)Phi=4pi G rho.]:}\begin{align*} d^{2} x^{i} / d t^{2} & =-\partial \Phi / \partial x^{i}, \tag{7.1}\\ \nabla^{2} \Phi & =4 \pi G \rho . \tag{7.2} \end{align*}(7.1)d2xi/dt2=Φ/xi,(7.2)2Φ=4πGρ.
These equations cannot be incorporated as they stand into special relativity. The equation of motion (7.1) for a particle is in three-dimensional rather than four-dimensional form; it requires modification into a four-dimensional vector equation for d 2 x μ / d τ 2 d 2 x μ / d τ 2 d^(2)x^(mu)//dtau^(2)d^{2} x^{\mu} / d \tau^{2}d2xμ/dτ2. Likewise, the field equation (7.2) is not Lorentz-invariant, since the appearance of a three-dimensional Laplacian operator instead of a four-dimensional d'Alembertian operator means that the potential Φ Φ Phi\PhiΦ responds instantaneously to changes in the density ρ ρ rho\rhoρ at arbitrarily large distances away. In brief, Newtonian gravitational fields propagate with infinite velocity.
One's first reaction to these problems might be to think that they are relatively straightforward to correct. Exercises at the end of this section study some relatively straightforward generalizations of these equations, in which the gravitational potential Φ Φ Phi\PhiΦ is taken to be first a scalar, then a vector, and finally a symmetric tensor field. Each of these theories has significant shortcomings, and all fail to agree with observations. The best of them is the tensor theory (exercise 7.3, Box 7.1), which, however,
This chapter is entirely Track 2.
It depends on no preceding Track-2 material. It is not needed as preparation for any later chapter, but will be helpful in Chapter 18 (weak gravitational fields), and in Chapters 38 and 39 (experimental tests and other theories of gravity).
Newton's gravitational laws must be modified into four-dimensional, geometric form
All straightforward modifications are unsatisfactory
Best modification (tensor theory in flat spacetime) is internally inconsistent; when repaired, it becomes general relativity.
is internally inconsistent and admits no exact solutions. This difficulty has been attacked in recent times by Gupta (1954, 1957, 1962), Kraichnan (1955), Thirring (1961), Feynman (1963), Weinberg (1965), Deser (1970). They show how the flatspace tensor theory may be modified within the spirit of present-day relativistic field theory to overcome these inconsistencies. By this field-theory route (part 5 of Box 17.2), they arrive uniquely at standard 1915 general relativity. Only at this end point does one finally recognize, from the mathematical form of the equations, that what ostensibly started out as a flat-space theory of gravity is really Einstein's theory, with gravitation being a manifestation of the curvature of spacetime. This book follows Einstein's line of reasoning because it keeps the physics to the fore.

EXERCISES

EXERCISES ON FLAT-SPACETIME THEORIES OF GRAVITY

The following three exercises provide a solid challenge. Happily, all three require similar techniques, and a solution to the most difficult one (exercise 7.3) is presented in Box 7.1. Therefore, it is reasonable to proceed as follows. (a) Work either exercise 7.1 (scalar theory of gravity) or 7.2 (vector theory of gravity), skimming exercise 7.3 and Box 7.1 (tensor theory of gravity) for outline and method, not for detail, whenever difficulties arise. (b) Become familiar with the results of the other exercise ( 7.2 or 7.1 ) by discussing it with someone who has worked it in detail. (c) Read in detail the solution to exercise 7.3 as presented in Box 7.1, and compare with the computed results for the other two theories. (d) Develop computational power by checking some detailed computations from Box 7.1.
Exercise 7.1. SCALAR GRAVITATIONAL FIELD, Φ Φ Phi\PhiΦ
A. Consider the variational principle δ I = 0 δ I = 0 delta I=0\delta I=0δI=0, where
(7.3) I = m e ϕ ( η α β d z α d λ d z β d λ ) 1 / 2 d λ (7.3) I = m e ϕ η α β d z α d λ d z β d λ 1 / 2 d λ {:(7.3)I=-m inte^(phi)(-eta_(alpha beta)(dz^(alpha))/(d lambda)(dz^(beta))/(d lambda))^(1//2)d lambda:}\begin{equation*} I=-m \int e^{\phi}\left(-\eta_{\alpha \beta} \frac{d z^{\alpha}}{d \lambda} \frac{d z^{\beta}}{d \lambda}\right)^{1 / 2} d \lambda \tag{7.3} \end{equation*}(7.3)I=meϕ(ηαβdzαdλdzβdλ)1/2dλ
Here m = m = m=m=m= (rest mass) and z α ( λ ) = z α ( λ ) = z^(alpha)(lambda)=z^{\alpha}(\lambda)=zα(λ)= (parametrized world line) for a test particle in the scalar gravitational field Φ Φ Phi\PhiΦ. By varying the particle's world line, derive differential equations governing the particle's motion. Write them using the particle's proper time as the path parameter,
d τ = ( η α β d z α d λ d z β d λ ) 1 / 2 d λ d τ = η α β d z α d λ d z β d λ 1 / 2 d λ d tau=(-eta_(alpha beta)(dz^(alpha))/(d lambda)(dz^(beta))/(d lambda))^(1//2)d lambdad \tau=\left(-\eta_{\alpha \beta} \frac{d z^{\alpha}}{d \lambda} \frac{d z^{\beta}}{d \lambda}\right)^{1 / 2} d \lambdadτ=(ηαβdzαdλdzβdλ)1/2dλ
so that u α = d z α / d τ u α = d z α / d τ u^(alpha)=dz^(alpha)//d tauu^{\alpha}=d z^{\alpha} / d \tauuα=dzα/dτ satisfies u α u β η α β = 1 u α u β η α β = 1 u^(alpha)u^(beta)eta_(alpha beta)=-1u^{\alpha} u^{\beta} \eta_{\alpha \beta}=-1uαuβηαβ=1.
B. Obtain the field equation for Φ ( x ) Φ ( x ) Phi(x)\Phi(\boldsymbol{x})Φ(x) implied by the variational principle δ I = 0 δ I = 0 delta I=0\delta I=0δI=0, where I = E d 4 x I = E d 4 x I=intEd^(4)xI=\int \mathcal{E} d^{4} xI=Ed4x and
(7.4) E = 1 8 π G η α β Φ x α Φ x β m e φ δ 4 [ x z ( τ ) ] d τ (7.4) E = 1 8 π G η α β Φ x α Φ x β m e φ δ 4 [ x z ( τ ) ] d τ {:(7.4)E=-(1)/(8pi G)eta^(alpha beta)(del Phi)/(delx^(alpha))(del Phi)/(delx^(beta))-int me^(varphi)delta^(4)[x-z(tau)]d tau:}\begin{equation*} \mathcal{E}=-\frac{1}{8 \pi G} \eta^{\alpha \beta} \frac{\partial \Phi}{\partial x^{\alpha}} \frac{\partial \Phi}{\partial x^{\beta}}-\int m e^{\varphi} \delta^{4}[\boldsymbol{x}-\boldsymbol{z}(\tau)] d \tau \tag{7.4} \end{equation*}(7.4)E=18πGηαβΦxαΦxβmeφδ4[xz(τ)]dτ
Show that the second term here gives the same integral as that studied in part A (equation 7.3).
Discussion: The field equations obtained describe how a single particle of mass m m mmm generates the scalar field. If many particles are present, one includes in E E E\mathcal{E}E a term m e ϕ δ 4 [ x z ( τ ) ] d τ m e ϕ δ 4 [ x z ( τ ) ] d τ -int me^(phi)delta^(4)[x-z(tau)]d tau-\int m e^{\phi} \delta^{4}[\boldsymbol{x}-\boldsymbol{z}(\tau)] d \taumeϕδ4[xz(τ)]dτ for each particle.
C. Solve the field equation of part B, assuming a single source particle at rest. Also assume that e ϕ = 1 e ϕ = 1 e^(phi)=1e^{\phi}=1eϕ=1 is an adequate approximation in the neighborhood of the particle. Then check this assumption from your solution; i.e., what value does it assign to e ϕ e ϕ e^(phi)e^{\phi}eϕ at the surface of the earth? (Units with c = 1 c = 1 c=1c=1c=1 are used throughout; one may also set G = 1 G = 1 G=1G=1G=1, if one wishes.)
D. Now treat the static, spherically symmetric field Φ Φ Phi\PhiΦ from part C as the field of the sun acting as a given external field in the variational principle of part A , and study the motion of a planet determined by this variational principle. Constants of motion are available from the spherical symmetry and time-independence of the integrand. Use spherical coordinates and assume motion in a plane. Derive a formula for the perihelion precession of a planet.
E. Pass to the limit of a zero rest-mass particle in the equations of motion of part A. Do this by using a parameter λ λ lambda\lambdaλ different from proper time, so chosen that k μ = d x μ / d λ k μ = d x μ / d λ k^(mu)=dx^(mu)//d lambdak^{\mu}=d x^{\mu} / d \lambdakμ=dxμ/dλ is the energy-momentum vector, and by taking the limit m 0 m 0 m longrightarrow0m \longrightarrow 0m0 with k 0 = γ m = E k 0 = γ m = E k^(0)=gamma m=Ek^{0}=\gamma m=Ek0=γm=E remaining finite (so u 0 = γ u 0 = γ u^(0)=gamma longrightarrow oou^{0}=\gamma \longrightarrow \inftyu0=γ ). Use these equations to show that the quantities q μ = k μ e ϕ q μ = k μ e ϕ q^(mu)=k^(mu)e^(phi)q^{\mu}=k^{\mu} e^{\phi}qμ=kμeϕ are constants of motion, and from this deduce that there is no bending of light by the sun in this scalar theory.

Exercise 7.2. VECTOR GRAVITATIONAL FIELD, Φ μ Φ μ Phi_(mu)\Phi_{\mu}Φμ

A. Verify that the variational principle δ I = 0 δ I = 0 delta I=0\delta I=0δI=0 gives Maxwell's equations by varying A μ A μ A_(mu)A_{\mu}Aμ, and the Lorentz force law by varying z μ ( τ ) z μ ( τ ) z^(mu)(tau)z^{\mu}(\tau)zμ(τ), when
(7.5) I = 1 16 π F μ ν F μ ν d 4 x + 1 2 m d z μ d τ d z μ d τ d τ + e d z μ d τ A μ ( z ) d τ (7.5) I = 1 16 π F μ ν F μ ν d 4 x + 1 2 m d z μ d τ d z μ d τ d τ + e d z μ d τ A μ ( z ) d τ {:(7.5)I=(-1)/(16 pi)intF_(mu nu)F^(mu nu)d^(4)x+(1)/(2)m int(dz^(mu))/(d tau)(dz_(mu))/(d tau)d tau+e int(dz^(mu))/(d tau)A_(mu)(z)d tau:}\begin{equation*} I=\frac{-1}{16 \pi} \int F_{\mu \nu} F^{\mu \nu} d^{4} x+\frac{1}{2} m \int \frac{d z^{\mu}}{d \tau} \frac{d z_{\mu}}{d \tau} d \tau+e \int \frac{d z^{\mu}}{d \tau} A_{\mu}(z) d \tau \tag{7.5} \end{equation*}(7.5)I=116πFμνFμνd4x+12mdzμdτdzμdτdτ+edzμdτAμ(z)dτ
Here F μ ν F μ ν F_(mu nu)F_{\mu \nu}Fμν is an abbreviation for A v , μ A μ , ν A v , μ A μ , ν A_(v,mu)-A_(mu,nu)A_{v, \mu}-A_{\mu, \nu}Av,μAμ,ν. Hint: to vary A μ ( x ) A μ ( x ) A_(mu)(x)A_{\mu}(\boldsymbol{x})Aμ(x), rewrite the last term as a spacetime integral by introducing a delta function δ 4 [ x z ( τ ) ] δ 4 [ x z ( τ ) ] delta^(4)[x-z(tau)]\delta^{4}[\boldsymbol{x}-\boldsymbol{z}(\tau)]δ4[xz(τ)] as in exercise 7.1, parts A and B.
B. Define, by analogy to the above, a vector gravitational field Φ μ Φ μ Phi_(mu)\Phi_{\mu}Φμ with G μ ν Φ ν , μ Φ μ , ν G μ ν Φ ν , μ Φ μ , ν G_(mu nu)-=Phi_(nu,mu)-Phi_(mu,nu)G_{\mu \nu} \equiv \Phi_{\nu, \mu}-\Phi_{\mu, \nu}GμνΦν,μΦμ,ν using a variational principle with
(7.6) I = + 1 16 π G G μ ν G μ v d 4 x + 1 2 m d z μ d τ d z μ d τ d τ + m Φ μ d z μ d τ d τ (7.6) I = + 1 16 π G G μ ν G μ v d 4 x + 1 2 m d z μ d τ d z μ d τ d τ + m Φ μ d z μ d τ d τ {:(7.6)I=+(1)/(16 pi G)intG_(mu nu)G^(mu v)d^(4)x+(1)/(2)m int(dz^(mu))/(d tau)(dz_(mu))/(d tau)d tau+m intPhi_(mu)(dz^(mu))/(d tau)d tau:}\begin{equation*} I=+\frac{1}{16 \pi G} \int G_{\mu \nu} G^{\mu v} d^{4} x+\frac{1}{2} m \int \frac{d z^{\mu}}{d \tau} \frac{d z_{\mu}}{d \tau} d \tau+m \int \Phi_{\mu} \frac{d z^{\mu}}{d \tau} d \tau \tag{7.6} \end{equation*}(7.6)I=+116πGGμνGμvd4x+12mdzμdτdzμdτdτ+mΦμdzμdτdτ
(Note: if many particles are present, one must augment I I III by terms 1 2 m ( d z μ / d τ ) ( d z μ / d τ ) d τ + 1 2 m d z μ / d τ d z μ / d τ d τ + (1)/(2)m int(dz^(mu)//d tau)(dz_(mu)//d tau)d tau+\frac{1}{2} m \int\left(d z^{\mu} / d \tau\right)\left(d z_{\mu} / d \tau\right) d \tau+12m(dzμ/dτ)(dzμ/dτ)dτ+ m Φ μ ( d z μ / d τ ) d τ m Φ μ d z μ / d τ d τ m intPhi_(mu)(dz^(mu)//d tau)d taum \int \Phi_{\mu}\left(d z^{\mu} / d \tau\right) d \taumΦμ(dzμ/dτ)dτ for each particle.) Find the "Coulomb" law in this theory, and verify that the coefficients of the terms in the variational principle have been chosen reasonably.
C. Compute the perihelion precession in this theory.
D. Compute the bending of light in this theory (i.e., scattering of a highly relativistic particle u 0 = γ u 0 = γ u^(0)=gamma longrightarrow oou^{0}=\gamma \longrightarrow \inftyu0=γ ), as it passes by the sun, because of the sun's Φ μ Φ μ Phi_(mu)\Phi_{\mu}Φμ field.
E. Obtain a formula for the total field energy corresponding to the Lagrangian implicit in part B. Use the standard method of Hamiltonian mechanics, with
I field = 1 16 π G G μ ν G μ v d 4 x E d 4 x I field  = 1 16 π G G μ ν G μ v d 4 x E d 4 x I_("field ")=(1)/(16 pi G)intG_(mu nu)G^(mu v)d^(4)x-=intEd^(4)xI_{\text {field }}=\frac{1}{16 \pi G} \int G_{\mu \nu} G^{\mu v} d^{4} x \equiv \int \mathscr{E} d^{4} xIfield =116πGGμνGμvd4xEd4x
L L L\mathcal{L}L is the Lagrangian density and L E d 3 x L E d 3 x L-=intEd^(3)xL \equiv \int \mathcal{E} d^{3} xLEd3x is the Lagrangian. The corresponding Hamiltonian density ( -=\equiv energy density) is
K = μ Φ μ , 0 L Φ μ , 0 E K = μ Φ μ , 0 L Φ μ , 0 E K=sum_(mu)Phi_(mu,0)(delL)/(delPhi_(mu,0))-E\mathscr{K}=\sum_{\mu} \Phi_{\mu, 0} \frac{\partial \mathcal{L}}{\partial \Phi_{\mu, 0}}-\mathcal{E}K=μΦμ,0LΦμ,0E
Show that vector gravitational waves carry negative energy.
Exercise 7.3. SYMMETRIC TENSOR GRAVITATIONAL FIELD, h μ ν = h ν μ h μ ν = h ν μ h_(mu nu)=h_(nu mu)h_{\mu \nu}=h_{\nu \mu}hμν=hνμ
Here the action principle is, as for the vector field, δ I = 0 δ I = 0 delta I=0\delta I=0δI=0, with I = I field + I particle + I = I field  + I particle  + I=I_("field ")+I_("particle ")+I=I_{\text {field }}+I_{\text {particle }}+I=Ifield +Iparticle + I interaction . I particle I interaction  . I particle  I_("interaction ").I_("particle ")I_{\text {interaction }} . I_{\text {particle }}Iinteraction .Iparticle  is the same as for the vector field:
(7.7) I particle = 1 2 m d z μ d τ d z μ d τ d τ (7.7) I particle  = 1 2 m d z μ d τ d z μ d τ d τ {:(7.7)I_("particle ")=(1)/(2)m int(dz^(mu))/(d tau)(dz_(mu))/(d tau)d tau:}\begin{equation*} I_{\text {particle }}=\frac{1}{2} m \int \frac{d z^{\mu}}{d \tau} \frac{d z_{\mu}}{d \tau} d \tau \tag{7.7} \end{equation*}(7.7)Iparticle =12mdzμdτdzμdτdτ
However, I field I field  I_("field ")I_{\text {field }}Ifield  and I interaction I interaction  I_("interaction ")I_{\text {interaction }}Iinteraction  are different:
(7.8b) I field = E f d 4 x , E f = 1 32 π G ( 1 2 h ν β , α h ¯ ν β , α h ¯ μ α , α h μ β , β ) [ Note that one h here is not an h ¯ ] , $ (7.8b) I field  = E f d 4 x , E f = 1 32 π G 1 2 h ν β , α h ¯ ν β , α h ¯ μ α , α h μ β , β  Note that   one  h  here   is not an  h ¯ , $ {:(7.8b){:[I_("field "),=intE_(f)d^(4)x","],[E_(f),=(-1)/(32 pi G)((1)/(2)h_(nu beta,alpha) bar(h)^(nu beta,alpha)- bar(h)_(mu alpha),alpha:}],[h^(mu beta)],[","beta])quad[[" Note that "],[" one "h" here "],[" is not an " bar(h)]]","$:}\left.\begin{array}{rl} I_{\text {field }} & =\int \mathcal{E}_{f} d^{4} x, \\ \mathcal{E}_{f} & =\frac{-1}{32 \pi G}\left(\frac{1}{2} h_{\nu \beta, \alpha} \bar{h}^{\nu \beta, \alpha}-\bar{h}_{\mu \alpha}, \alpha\right. \tag{7.8b}\\ h^{\mu \beta} \\ , \beta \end{array}\right) \quad\left[\begin{array}{l} \text { Note that } \\ \text { one } h \text { here } \\ \text { is not an } \bar{h} \end{array}\right], ~ \$(7.8b)Ifield =Efd4x,Ef=132πG(12hνβ,αh¯νβ,αh¯μα,αhμβ,β)[ Note that  one h here  is not an h¯], $
with
(7.8c) h ¯ μ ν h μ ν 1 2 η μ ν h σ σ (7.9) I interaction = 1 2 h μ ν T μ ν d 4 x . (7.8c) h ¯ μ ν h μ ν 1 2 η μ ν h σ σ (7.9) I interaction  = 1 2 h μ ν T μ ν d 4 x . {:[(7.8c) bar(h)_(mu nu)-=h_(mu nu)-(1)/(2)eta_(mu nu)h_(sigma)^(sigma)],[(7.9)I_("interaction ")=(1)/(2)inth_(mu nu)T^(mu nu)d^(4)x.]:}\begin{align*} \bar{h}_{\mu \nu} & \equiv h_{\mu \nu}-\frac{1}{2} \eta_{\mu \nu} h_{\sigma}^{\sigma} \tag{7.8c}\\ I_{\text {interaction }} & =\frac{1}{2} \int h_{\mu \nu} T^{\mu \nu} d^{4} x . \tag{7.9} \end{align*}(7.8c)h¯μνhμν12ημνhσσ(7.9)Iinteraction =12hμνTμνd4x.
Here T μ ν T μ ν T^(mu nu)T^{\mu \nu}Tμν is the stress-energy tensor for all nongravitational fields and matter present. For a system of point particles (used throughout this exercise),
(7.10) T μ ν ( x ) = m d z μ d τ d z ν d τ δ 4 [ x z ( τ ) ] d τ (7.10) T μ ν ( x ) = m d z μ d τ d z ν d τ δ 4 [ x z ( τ ) ] d τ {:(7.10)T^(mu nu)(x)=int m(dz^(mu))/(d tau)(dz^(nu))/(d tau)delta^(4)[x-z(tau)]d tau:}\begin{equation*} T^{\mu \nu}(\boldsymbol{x})=\int m \frac{d z^{\mu}}{d \tau} \frac{d z^{\nu}}{d \tau} \delta^{4}[\boldsymbol{x}-\boldsymbol{z}(\tau)] d \tau \tag{7.10} \end{equation*}(7.10)Tμν(x)=mdzμdτdzνdτδ4[xz(τ)]dτ
A. Obtain the equations of motion of a particle by varying z μ ( τ ) z μ ( τ ) z^(mu)(tau)z^{\mu}(\tau)zμ(τ) in δ ( I particle + δ I particle  + delta(I_("particle ")+:}\delta\left(I_{\text {particle }}+\right.δ(Iparticle + I interaction ) = 0 I interaction  = 0 {:I_("interaction "))=0\left.I_{\text {interaction }}\right)=0Iinteraction )=0. Express your result in terms of the "gravitational force field"
(7.11) Γ ν α β = 1 2 ( h ν α , β + h ν β , α h α β , ν ) (7.11) Γ ν α β = 1 2 h ν α , β + h ν β , α h α β , ν {:(7.11)Gamma_(nu alpha beta)=(1)/(2)(h_(nu alpha,beta)+h_(nu beta,alpha)-h_(alpha beta,nu)):}\begin{equation*} \Gamma_{\nu \alpha \beta}=\frac{1}{2}\left(h_{\nu \alpha, \beta}+h_{\nu \beta, \alpha}-h_{\alpha \beta, \nu}\right) \tag{7.11} \end{equation*}(7.11)Γναβ=12(hνα,β+hνβ,αhαβ,ν)
derived from the tensor gravitational potentials h μ ν = h ν μ h μ ν = h ν μ h_(mu nu)=h_(nu mu)h_{\mu \nu}=h_{\nu \mu}hμν=hνμ.
B. Obtain the field equations from δ ( I field + I interaction ) = 0 δ I field  + I interaction  = 0 delta(I_("field ")+I_("interaction "))=0\delta\left(I_{\text {field }}+I_{\text {interaction }}\right)=0δ(Ifield +Iinteraction )=0; express them in terms of
(7.12) H μ α ν β h ¯ μ ν η α β + h ¯ α β η μ ν h ¯ α ν η μ β h ¯ μ β η α ν (7.12) H μ α ν β h ¯ μ ν η α β + h ¯ α β η μ ν h ¯ α ν η μ β h ¯ μ β η α ν {:(7.12)-H^(mu alpha nu beta)-= bar(h)^(mu nu)eta^(alpha beta)+ bar(h)^(alpha beta)eta^(mu nu)- bar(h)^(alpha nu)eta^(mu beta)- bar(h)^(mu beta)eta^(alpha nu):}\begin{equation*} -H^{\mu \alpha \nu \beta} \equiv \bar{h}^{\mu \nu} \eta^{\alpha \beta}+\bar{h}^{\alpha \beta} \eta^{\mu \nu}-\bar{h}^{\alpha \nu} \eta^{\mu \beta}-\bar{h}^{\mu \beta} \eta^{\alpha \nu} \tag{7.12} \end{equation*}(7.12)Hμανβh¯μνηαβ+h¯αβημνh¯ανημβh¯μβηαν
Discuss gauge invariance, and the condition h ¯ μ α , α = 0 h ¯ μ α , α = 0 bar(h)^(mu alpha)_(,alpha)=0\bar{h}^{\mu \alpha}{ }_{, \alpha}=0h¯μα,α=0.
C. Find the tensor gravitational potentials h μ ν , α h μ ν , α h_(mu nu)^(,alpha)h_{\mu \nu}^{, \alpha}hμν,α due to the sun (treated as a point mass).
D. Compute the perihelion precession.
E. Compute the bending of light.
F. Consider a gravitational wave
(7.13) h ¯ μ ν = A μ ν exp ( i k α x α ) (7.13) h ¯ μ ν = A μ ν exp i k α x α {:(7.13) bar(h)^(mu nu)=A^(mu nu)exp(ik_(alpha)x^(alpha)):}\begin{equation*} \bar{h}^{\mu \nu}=A^{\mu \nu} \exp \left(i k_{\alpha} x^{\alpha}\right) \tag{7.13} \end{equation*}(7.13)h¯μν=Aμνexp(ikαxα)
What conditions are imposed by the field equations? By the gauge condition
(7.14) h ¯ μ α , α = 0 ? (7.14) h ¯ μ α , α = 0 ? {:(7.14) bar(h)^(mu alpha)_(,alpha)=0?:}\begin{equation*} \bar{h}^{\mu \alpha}{ }_{, \alpha}=0 ? \tag{7.14} \end{equation*}(7.14)h¯μα,α=0?
Show that, by further gauge transformations
(7.15) h μ ν h μ ν + ξ μ , ν + ξ v , μ (7.15) h μ ν h μ ν + ξ μ , ν + ξ v , μ {:(7.15)h_(mu nu)longrightarrowh_(mu nu)+xi_(mu,nu)+xi_(v,mu):}\begin{equation*} h_{\mu \nu} \longrightarrow h_{\mu \nu}+\xi_{\mu, \nu}+\xi_{v, \mu} \tag{7.15} \end{equation*}(7.15)hμνhμν+ξμ,ν+ξv,μ
that preserve the h ¯ μ α , α = 0 h ¯ μ α , α = 0 bar(h)^(mu alpha)_(,alpha)=0\bar{h}^{\mu \alpha}{ }_{, \alpha}=0h¯μα,α=0 restrictions, further conditions
(7.16) u α h ¯ α μ = 0 , h ¯ α α = 0 (7.16) u α h ¯ α μ = 0 , h ¯ α α = 0 {:(7.16)u_(alpha) bar(h)^(alpha mu)=0","quad bar(h)_(alpha)^(alpha)=0:}\begin{equation*} u_{\alpha} \bar{h}^{\alpha \mu}=0, \quad \bar{h}_{\alpha}^{\alpha}=0 \tag{7.16} \end{equation*}(7.16)uαh¯αμ=0,h¯αα=0
can be imposed, where u α u α u^(alpha)u^{\alpha}uα is a fixed, timelike vector. It is sufficient to consider the case, obtained by a suitable choice of reference frame, where u α = ( 1 ; 0 , 0 , 0 ) u α = ( 1 ; 0 , 0 , 0 ) u^(alpha)=(1;0,0,0)u^{\alpha}=(1 ; 0,0,0)uα=(1;0,0,0) and k α = ( ω ; 0 , 0 , ω ) k α = ( ω ; 0 , 0 , ω ) k^(alpha)=(omega;0,0,omega)k^{\alpha}=(\omega ; 0,0, \omega)kα=(ω;0,0,ω).
G. From the Hamiltonian density
(7.17) H h ˙ μ ν ( L / h ˙ μ ν ) L (7.17) H h ˙ μ ν L / h ˙ μ ν L {:(7.17)H-=h^(˙)_(mu nu)(delL//delh^(˙)_(mu nu))-L:}\begin{equation*} \mathscr{H} \equiv \dot{h}_{\mu \nu}\left(\partial \mathcal{L} / \partial \dot{h}_{\mu \nu}\right)-\mathcal{L} \tag{7.17} \end{equation*}(7.17)Hh˙μν(L/h˙μν)L
for the field, show that the energy density of the waves considered in part F is positive.
H. Compute T μ ν , T μ ν , T^(mu nu),^(')T^{\mu \nu},{ }^{\prime}Tμν,, for the stress-energy tensor of particles T μ ν T μ ν T^(mu nu)T^{\mu \nu}Tμν that appears in the action integral I I III. Does T μ ν , v T μ ν , v T^(mu nu),vT^{\mu \nu}, vTμν,v vanish (e.g., for the earth in orbit around the sun)? Why? Show that the coupled equations for fields and particles obtained from δ I = 0 δ I = 0 delta I=0\delta I=0δI=0 have no solutions.

Box 7.1 AN ATTEMPT TO DESCRIBE GRAVITY BY A SYMMETRIC TENSOR FIELD IN FLAT SPACETIME [Solution to exercise 7.3]

Attempts to describe gravity within the framework of special relativity would naturally begin by considering the gravitational field to be a scalar (exercise 7.1) as it is in Newtonian theory, or a vector (exercise 7.2) by analogy to electromagnetism. Only after these are found to be deficient (e.g., no bending of light in either theory; negative-energy waves in the vector theory) would one face the computational complexities of a symmetric tensor gravitational potential, h μ ν = h ν μ h μ ν = h ν μ h_(mu nu)=h_(nu mu)h_{\mu \nu}=h_{\nu \mu}hμν=hνμ, which has more indices.
The foundations of the most satisfactory of all tensor theories of gravity in flat spacetime are laid out at the beginning of exercise 7.3. The choice of the Lagrangian made there (equations 7.8) is dictated by the demand that h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν be a "Lorentz covariant, massless, spin-two field." The meaning of this demand, and the techniques of special relativity required to translate it into a set of field equations, are customarily found in books on elementary particle physics or quantum field theory; see, e.g., Wentzel (1949), Feynman (1963), or Gasiorowicz (1966). Fierz and Pauli (1939) were the first to write down this Lagrangian and investigate the resulting theory. The conclusions of the theory are spelled out here in the form of a solution to exercise 7.3.

A. Equation of Motion for a Test Particle (exercise 7.3A)

Carry out the integration in equation (7.9), using the particle stress-energy tensor of equation (7.10), to find
(1) I p + i I particle + I interaction = 1 2 m ( η μ ν + h μ ν ) z ˙ μ z ˙ ν d τ (1) I p + i I particle  + I interaction  = 1 2 m η μ ν + h μ ν z ˙ μ z ˙ ν d τ {:(1)I_(p+i)-=I_("particle ")+I_("interaction ")=(1)/(2)m int(eta_(mu nu)+h_(mu nu))z^(˙)^(mu)z^(˙)^(nu)d tau:}\begin{equation*} I_{p+i} \equiv I_{\text {particle }}+I_{\text {interaction }}=\frac{1}{2} m \int\left(\eta_{\mu \nu}+h_{\mu \nu}\right) \dot{z}^{\mu} \dot{z}^{\nu} d \tau \tag{1} \end{equation*}(1)Ip+iIparticle +Iinteraction =12m(ημν+hμν)z˙μz˙νdτ
where
z ˙ μ d z μ / d τ z ˙ μ d z μ / d τ z^(˙)^(mu)-=dz^(mu)//d tau\dot{z}^{\mu} \equiv d z^{\mu} / d \tauz˙μdzμ/dτ
Then compute δ I p + i δ I p + i deltaI_(p+i)\delta I_{p+i}δIp+i, and find that the coefficient of the arbitrary variation in path δ z μ δ z μ deltaz^(mu)\delta z^{\mu}δzμ vanishes if and only if
( d / d τ ) [ ( η μ ν + h μ ν ) z ˙ ν ] 1 2 h α β , μ z ˙ α z ˙ β = 0 ( d / d τ ) η μ ν + h μ ν z ˙ ν 1 2 h α β , μ z ˙ α z ˙ β = 0 (d//d tau)[(eta_(mu nu)+h_(mu nu))z^(˙)^(nu)]-(1)/(2)h_(alpha beta,mu)z^(˙)^(alpha)z^(˙)^(beta)=0(d / d \tau)\left[\left(\eta_{\mu \nu}+h_{\mu \nu}\right) \dot{z}^{\nu}\right]-\frac{1}{2} h_{\alpha \beta, \mu} \dot{z}^{\alpha} \dot{z}^{\beta}=0(d/dτ)[(ημν+hμν)z˙ν]12hαβ,μz˙αz˙β=0
Rewrite this equation of motion in the form
(2) ( η μ ν + h μ ν ) z ¨ ν + Γ μ α β z ˙ α z ˙ β = 0 (2) η μ ν + h μ ν z ¨ ν + Γ μ α β z ˙ α z ˙ β = 0 {:(2)(eta_(mu nu)+h_(mu nu))z^(¨)^(nu)+Gamma_(mu alpha beta)z^(˙)^(alpha)z^(˙)^(beta)=0:}\begin{equation*} \left(\eta_{\mu \nu}+h_{\mu \nu}\right) \ddot{z}^{\nu}+\Gamma_{\mu \alpha \beta} \dot{z}^{\alpha} \dot{z}^{\beta}=0 \tag{2} \end{equation*}(2)(ημν+hμν)z¨ν+Γμαβz˙αz˙β=0
where Γ μ α β Γ μ α β Gamma_(mu alpha beta)\Gamma_{\mu \alpha \beta}Γμαβ is defined in equation (7.11).

B 1 B 1 B_(1)B_{1}B1. Field Equations (exercise 7.3B)

Use I field I field  I_("field ")I_{\text {field }}Ifield  and I interaction I interaction  I_("interaction ")I_{\text {interaction }}Iinteraction  in the forms given in equations (7.8) and (7.9); but for the quickest and least messy derivation, do not use the standard Euler-Lagrange equations. Instead, compute directly the first-order change δ L f δ L f deltaL_(f)\delta \mathcal{L}_{f}δLf produced by a small

Box 7.1 (continued)

variation δ h α β δ h α β deltah_(alpha beta)\delta h_{\alpha \beta}δhαβ of the field. For the second term of E ρ E ρ E_(rho)\mathcal{E}_{\rho}Eρ, it is clear (by relabeling dummy indices as needed) that varying each factor gives the same result, so the two terms from the product rule combine:
δ ( h ¯ μ α h ¯ μ β , β ) = 2 h ¯ μ β , β δ h ¯ μ α , α . δ h ¯ μ α h ¯ μ β , β = 2 h ¯ μ β , β δ h ¯ μ α , α . delta( bar(h)_(mualpha^(')) bar(h)^(mu beta)_(,beta))=2 bar(h)^(mu beta)_(,beta)delta bar(h)_(mualpha^('),alpha).\delta\left(\bar{h}_{\mu \alpha^{\prime}} \bar{h}^{\mu \beta}{ }_{, \beta}\right)=2 \bar{h}^{\mu \beta}{ }_{, \beta} \delta \bar{h}_{\mu \alpha^{\prime}, \alpha} .δ(h¯μαh¯μβ,β)=2h¯μβ,βδh¯μα,α.
A similar result holds for the first term of E f E f E_(f)\mathcal{E}_{f}Ef, in view of the identity a μ ν b ¯ μ ν = a ¯ μ ν b μ ν a μ ν b ¯ μ ν = a ¯ μ ν b μ ν a_(mu nu) bar(b)^(mu nu)= bar(a)_(mu nu)b^(mu nu)a_{\mu \nu} \bar{b}^{\mu \nu}=\bar{a}_{\mu \nu} b^{\mu \nu}aμνb¯μν=a¯μνbμν, which holds for the "bar" operation of equations (7.8); each side here is just a μ ν b μ ν a μ ν b μ ν a_(mu nu)b^(mu nu)a_{\mu \nu} b^{\mu \nu}aμνbμν 1 2 a μ μ b ν p 1 2 a μ μ b ν p -(1)/(2)a^(mu)_(mu)b^(nu)_(p)-\frac{1}{2} a^{\mu}{ }_{\mu} b^{\nu}{ }_{p}12aμμbνp. Consequently,
(3) ( 32 π G ) δ Q f = h ¯ ν β , α δ h ν β , α 2 h ¯ μ β , β δ h ¯ μ α α . (3) ( 32 π G ) δ Q f = h ¯ ν β , α δ h ν β , α 2 h ¯ μ β , β δ h ¯ μ α α . {:(3)-(32 pi G)deltaQ_(f)= bar(h)^(nu beta,alpha)deltah_(nu beta,alpha)-2 bar(h)^(mu beta)_(,beta)delta bar(h)_(mualpha^('))^(alpha).:}\begin{equation*} -(32 \pi G) \delta Q_{f}=\bar{h}^{\nu \beta, \alpha} \delta h_{\nu \beta, \alpha}-2 \bar{h}^{\mu \beta}{ }_{, \beta} \delta \bar{h}_{\mu \alpha^{\prime}}{ }^{\alpha} . \tag{3} \end{equation*}(3)(32πG)δQf=h¯νβ,αδhνβ,α2h¯μβ,βδh¯μαα.
Next use this expression in δ I field δ I field  deltaI_("field ")\delta I_{\text {field }}δIfield ; and, by an integration by parts, remove the derivatives from δ h μ ν δ h μ ν deltah_(mu nu)\delta h_{\mu \nu}δhμν, giving
δ I field = ( 32 π G ) 1 [ h ¯ ν β , α , α δ h ν β 2 h ¯ , β μ β , , α δ h ¯ μ α ] d 4 x δ I field  = ( 32 π G ) 1 h ¯ ν β , α , α δ h ν β 2 h ¯ , β μ β , , α δ h ¯ μ α d 4 x deltaI_("field ")=(32 pi G)^(-1)int[ bar(h)^(nu beta,alpha)_(,alpha)deltah_(nu beta)-2 bar(h)_(,beta)^(mu beta),_(,alpha)delta bar(h)_(mu alpha)]d^(4)x\delta I_{\text {field }}=(32 \pi G)^{-1} \int\left[\bar{h}^{\nu \beta, \alpha}{ }_{, \alpha} \delta h_{\nu \beta}-2 \bar{h}_{, \beta}^{\mu \beta},_{, \alpha} \delta \bar{h}_{\mu \alpha}\right] d^{4} xδIfield =(32πG)1[h¯νβ,α,αδhνβ2h¯,βμβ,,αδh¯μα]d4x
To find the coefficient of δ h μ ν δ h μ ν deltah_(mu nu)\delta h_{\mu \nu}δhμν in this expression, write (from equation 7.8c)
δ h ¯ α β = ( δ α μ δ ν β 1 2 η α β η μ ν ) δ h μ ν ; δ h ¯ α β = δ α μ δ ν β 1 2 η α β η μ ν δ h μ ν ; delta bar(h)_(alpha beta)=(delta_(alpha)^(mu)delta_(nu beta)-(1)/(2)eta_(alpha beta)eta^(mu nu))deltah_(mu nu);\delta \bar{h}_{\alpha \beta}=\left(\delta_{\alpha}^{\mu} \delta_{\nu \beta}-\frac{1}{2} \eta_{\alpha \beta} \eta^{\mu \nu}\right) \delta h_{\mu \nu} ;δh¯αβ=(δαμδνβ12ηαβημν)δhμν;
and then rearrange and relabel dummy (summation) indices to obtain
δ I field = ( 32 π G ) 1 [ h ¯ μ β , α , α δ h ν β 2 h ¯ μ β , β , α δ h ¯ μ α ] d 4 x δ I field  = ( 32 π G ) 1 h ¯ μ β , α , α δ h ν β 2 h ¯ μ β , β , α δ h ¯ μ α d 4 x deltaI_("field ")=(32 pi G)^(-1)int[ bar(h)^(mu beta,alpha)_(,alpha)deltah_(nu beta)-2 bar(h)^(mu beta)_(,beta)^(,alpha)delta bar(h)_(mu alpha)]d^(4)x\delta I_{\text {field }}=(32 \pi G)^{-1} \int\left[\bar{h}^{\mu \beta, \alpha}{ }_{, \alpha} \delta h_{\nu \beta}-2 \bar{h}^{\mu \beta}{ }_{, \beta}^{, \alpha} \delta \bar{h}_{\mu \alpha}\right] d^{4} xδIfield =(32πG)1[h¯μβ,α,αδhνβ2h¯μβ,β,αδh¯μα]d4x
By combining this with δ I interaction = 1 2 T μ ν δ h μ ν d 4 x δ I interaction  = 1 2 T μ ν δ h μ ν d 4 x deltaI_("interaction ")=(1)/(2)T^(mu nu)deltah_(mu nu)d^(4)x\delta I_{\text {interaction }}=\frac{1}{2} T^{\mu \nu} \delta h_{\mu \nu} d^{4} xδIinteraction =12Tμνδhμνd4x, and by using the symmetry δ h μ ν = δ h ν μ δ h μ ν = δ h ν μ deltah_(mu nu)=deltah_(nu mu)\delta h_{\mu \nu}=\delta h_{\nu \mu}δhμν=δhνμ, obtain
(4) h ¯ μ ν , α α η μ ν h ¯ α β , α β + h ¯ μ α , α , ν + h ¯ ν α , α , μ = 16 π G T μ ν . (4) h ¯ μ ν , α α η μ ν h ¯ α β , α β + h ¯ μ α , α , ν + h ¯ ν α , α , μ = 16 π G T μ ν . {:(4)- bar(h)^(mu nu)_(,alpha^(alpha))-eta^(mu nu) bar(h)^(alpha beta)_(,alpha beta)+ bar(h)^(mu alpha)_(,alpha^(,nu))+ bar(h)^(nu alpha)_(,alpha^(,mu))=16 pi GT^(mu nu).:}\begin{equation*} -\bar{h}^{\mu \nu}{ }_{, \alpha^{\alpha}}-\eta^{\mu \nu} \bar{h}^{\alpha \beta}{ }_{, \alpha \beta}+\bar{h}^{\mu \alpha}{ }_{, \alpha^{, \nu}}+\bar{h}^{\nu \alpha}{ }_{, \alpha^{, \mu}}=16 \pi G T^{\mu \nu} . \tag{4} \end{equation*}(4)h¯μν,ααημνh¯αβ,αβ+h¯μα,α,ν+h¯να,α,μ=16πGTμν.
The definition made in equation (7.12) allows this to be rewritten as
( ) H μ α ν β , α β = 16 π G T μ ν . ( ) H μ α ν β , α β = 16 π G T μ ν . {:('")"H^(mu alpha nu beta)_(,alpha beta)=16 pi GT^(mu nu).:}\begin{equation*} H^{\mu \alpha \nu \beta}{ }_{, \alpha \beta}=16 \pi G T^{\mu \nu} . \tag{$\prime$} \end{equation*}()Hμανβ,αβ=16πGTμν.

B 2 B 2 B_(2)B_{2}B2. Gauge Invariance (exercise 7.3B, continued)

The symmetries,
H μ α ν β = H [ μ α ] [ ν β ] = H ν β μ α , H μ α ν β = H [ μ α ] [ ν β ] = H ν β μ α , H^(mu alpha nu beta)=H^([mu alpha][nu beta])=H^(nu beta mu alpha),H^{\mu \alpha \nu \beta}=H^{[\mu \alpha][\nu \beta]}=H^{\nu \beta \mu \alpha},Hμανβ=H[μα][νβ]=Hνβμα,
of H μ α ν β H μ α ν β H^(mu alpha nu beta)H^{\mu \alpha \nu \beta}Hμανβ imply an identity
H μ α ν β , α β ν = H μ α [ ν β 1 , α ( β ν ) 0 H μ α ν β , α β ν = H μ α [ ν β 1 , α ( β ν ) 0 H^(mu alpha nu beta)_(,alpha beta nu)=H^(mu alpha[nu beta1)_(,alpha(beta nu))-=0H^{\mu \alpha \nu \beta}{ }_{, \alpha \beta \nu}=H^{\mu \alpha[\nu \beta 1}{ }_{, \alpha(\beta \nu)} \equiv 0Hμανβ,αβν=Hμα[νβ1,α(βν)0
analogous to F μ ν , v μ 0 F μ ν , v μ 0 F^(mu nu)_(,v mu)-=0F^{\mu \nu}{ }_{, v \mu} \equiv 0Fμν,vμ0 in electromagnetism.
Thus T μ ν , ν = 0 T μ ν , ν = 0 T^(mu nu)_(,nu)=0T^{\mu \nu}{ }_{, \nu}=0Tμν,ν=0 is required of the sources, just as is J μ , μ = 0 J μ , μ = 0 J^(mu)_(,mu)=0J^{\mu}{ }_{, \mu}=0Jμ,μ=0 in electromagnetism (exercise 3.16). These identities make the field equations (4') too weak to fix h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν
completely. In particular, by direct substitution in equations (4), one verifies that to any solution one can add a gauge field
(5) h μ ν (gauge) = ξ μ , ν + ξ ν , μ , h ¯ μ ν (gauge) = ξ μ , ν + ξ ν , μ η μ ν ξ α , α , (5) h μ ν (gauge)  = ξ μ , ν + ξ ν , μ , h ¯ μ ν (gauge)  = ξ μ , ν + ξ ν , μ η μ ν ξ α , α , {:[(5)h_(mu nu)^((gauge) )=xi_(mu,nu)+xi_(nu,mu)","],[ bar(h)_(mu nu)^((gauge) )=xi_(mu,nu)+xi_(nu,mu)-eta_(mu nu)xi^(alpha)_(,alpha)","]:}\begin{align*} & h_{\mu \nu}{ }^{\text {(gauge) }}=\xi_{\mu, \nu}+\xi_{\nu, \mu}, \tag{5}\\ & \bar{h}_{\mu \nu}{ }^{\text {(gauge) }}=\xi_{\mu, \nu}+\xi_{\nu, \mu}-\eta_{\mu \nu} \xi^{\alpha}{ }_{, \alpha}, \end{align*}(5)hμν(gauge) =ξμ,ν+ξν,μ,h¯μν(gauge) =ξμ,ν+ξν,μημνξα,α,
without changing T μ ν T μ ν T^(mu nu)T^{\mu \nu}Tμν.
Let ξ μ ξ μ xi_(mu)\xi_{\mu}ξμ vanish outside some finite spacetime volume, but be otherwise arbitrary. Then h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν and h μ ν = h μ ν + h μ ν (gauge) h μ ν = h μ ν + h μ ν (gauge)  h_(mu nu)=h_(mu nu)+h_(mu nu)^((gauge) )h_{\mu \nu}=h_{\mu \nu}+h_{\mu \nu}^{\text {(gauge) }}hμν=hμν+hμν(gauge)  both satisfy the source equation (4) for the same source T μ ν T μ ν T^(mu nu)T^{\mu \nu}Tμν and the same boundary conditions at infinity. We therefore expect them to be physically equivalent.
By a specialization of the gauge analogous to the "Lorentz" specialization A α , α = 0 A α , α = 0 A^(alpha)_(,alpha)=0A^{\alpha}{ }_{, \alpha}=0Aα,α=0 of electromagnetism (equation 3.58a; exercise 3.17), one imposes the condition
(6) h ¯ μ α , α = 0 . (6) h ¯ μ α , α = 0 . {:(6) bar(h)^(mu alpha)_(,alpha)=0.:}\begin{equation*} \bar{h}^{\mu \alpha}{ }_{, \alpha}=0 . \tag{6} \end{equation*}(6)h¯μα,α=0.
This reduces the field equations (4) to the simple d'Alembertian form
(7) h ¯ μ ν h ¯ μ ν , α α = 16 π T μ ν (7) h ¯ μ ν h ¯ μ ν , α α = 16 π T μ ν {:(7)◻ bar(h)^(mu nu)-= bar(h)^(mu nu)_(,alpha)^(alpha)=-16 piT^(mu nu):}\begin{equation*} \square \bar{h}^{\mu \nu} \equiv \bar{h}^{\mu \nu}{ }_{, \alpha}^{\alpha}=-16 \pi T^{\mu \nu} \tag{7} \end{equation*}(7)h¯μνh¯μν,αα=16πTμν
(see exercise 18.2). Here and henceforth we set G = 1 G = 1 G=1G=1G=1 ("geometrized units").

C. Field of a Point Mass (exercise 7.3C)

For a static source, the wave equation (7) reduces to a Laplace equation
2 h ¯ μ ν = 16 π T μ ν 2 h ¯ μ ν = 16 π T μ ν grad^(2) bar(h)_(mu nu)=-16 piT_(mu nu)\nabla^{2} \bar{h}_{\mu \nu}=-16 \pi T_{\mu \nu}2h¯μν=16πTμν
The stress-energy tensor for a static point mass (equation 7.10) is T 00 = M δ 3 ( x ) T 00 = M δ 3 ( x ) T^(00)=Mdelta^(3)(x)T^{00}=M \delta^{3}(x)T00=Mδ3(x) and T μ k = 0 T μ k = 0 T^(mu k)=0T^{\mu k}=0Tμk=0. Put this into the Laplace equation, solve for h ¯ μ ν h ¯ μ ν bar(h)_(mu nu)\bar{h}_{\mu \nu}h¯μν, and use equation (7.8c) to obtain h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν. The result is:
(8) h 00 = 2 M / r ; h 0 k = 0 ; h i k = δ i k ( 2 M / r ) (8) h 00 = 2 M / r ; h 0 k = 0 ; h i k = δ i k ( 2 M / r ) {:(8)h_(00)=2M//r;quadh_(0k)=0;quadh_(ik)=delta_(ik)(2M//r):}\begin{equation*} h_{00}=2 M / r ; \quad h_{0 k}=0 ; \quad h_{i k}=\delta_{i k}(2 M / r) \tag{8} \end{equation*}(8)h00=2M/r;h0k=0;hik=δik(2M/r)
(see equation 18.15a).

D. Perihelion Precession (exercise 7.3D)

Direct substitution of the potential (8) into the equations of motion (2) is tedious and not very instructive. Variational principles are popular in mechanics because they simplify such calculations. Return to the basic variational principle δ I p + i = 0 δ I p + i = 0 deltaI_(p+i)=0\delta I_{p+i}=0δIp+i=0 (equation 1), and insert the potential (8) for the sun. Convert to spherical coordinates so oriented that the orbit lies in the equatorial ( θ = π / 2 ) ( θ = π / 2 ) (theta=pi//2)(\theta=\pi / 2)(θ=π/2) plane:
(9) I p + i = L d τ (10) L = 1 2 m [ ( 1 2 M r 1 ) t ˙ 2 + ( 1 + 2 M r 1 ) ( r ˙ 2 + r 2 ϕ 2 ) ] (9) I p + i = L d τ (10) L = 1 2 m 1 2 M r 1 t ˙ 2 + 1 + 2 M r 1 r ˙ 2 + r 2 ϕ 2 {:[(9)I_(p+i)=int Ld tau],[(10)L=(1)/(2)m[-(1-2Mr^(-1))t^(˙)^(2)+(1+2Mr^(-1))(r^(˙)^(2)+r^(2)phi^(2))]]:}\begin{align*} & I_{p+i}=\int L d \tau \tag{9}\\ & L=\frac{1}{2} m\left[-\left(1-2 M r^{-1}\right) \dot{t}^{2}+\left(1+2 M r^{-1}\right)\left(\dot{r}^{2}+r^{2} \phi^{2}\right)\right] \tag{10} \end{align*}(9)Ip+i=Ldτ(10)L=12m[(12Mr1)t˙2+(1+2Mr1)(r˙2+r2ϕ2)]

Box 7.1 (continued)

From the absence of explicit t t ttt-, ϕ ϕ phi\phiϕ-, and τ τ tau\tauτ-dependence in L L LLL, infer three constants of motion: the canonical momenta
P t m γ = L / i ˙ P t m γ = L / i ˙ P_(t)-=-m gamma=del L//deli^(˙)P_{t} \equiv-m \gamma=\partial L / \partial \dot{i}Ptmγ=L/i˙
(this defines γ γ gamma\gammaγ ) and
P ϕ m α = L / ϕ ˙ P ϕ m α = L / ϕ ˙ P_(phi)-=m alpha=del L//delphi^(˙)P_{\phi} \equiv m \alpha=\partial L / \partial \dot{\phi}Pϕmα=L/ϕ˙
(this defines α α alpha\alphaα ); and the Hamiltonian
H = x ˙ μ ( L / x ˙ μ ) L , H = x ˙ μ L / x ˙ μ L , H=x^(˙)^(mu)(del L//delx^(˙)^(mu))-L,H=\dot{x}^{\mu}\left(\partial L / \partial \dot{x}^{\mu}\right)-L,H=x˙μ(L/x˙μ)L,
which can be set equal to 1 2 m 1 2 m -(1)/(2)m-\frac{1}{2} m12m by appropriate normalization of the path parameter τ τ tau\tauτ. From these constants of the motion, derive an orbit equation as follows: (1) calculate H = 1 2 m H = 1 2 m H=-(1)/(2)mH=-\frac{1}{2} mH=12m in terms of r , r ˙ , ϕ ˙ r , r ˙ , ϕ ˙ r,r^(˙),phi^(˙)r, \dot{r}, \dot{\phi}r,r˙,ϕ˙, and i ˙ i ˙ i^(˙)\dot{i}i˙; (2) eliminate i ˙ i ˙ i^(˙)\dot{i}i˙ and ϕ ˙ ϕ ˙ phi^(˙)\dot{\phi}ϕ˙ in favor of the constants γ γ gamma\gammaγ and α α alpha\alphaα; (3) as in Newtonian orbit problems, define u = M / r u = M / r u=M//ru=M / ru=M/r, and write
d u d ϕ ˙ = u ˙ ϕ ˙ = M r ˙ r 2 ϕ ˙ = M α ( 1 + 2 u ) r ˙ ; d u d ϕ ˙ = u ˙ ϕ ˙ = M r ˙ r 2 ϕ ˙ = M α ( 1 + 2 u ) r ˙ ; (du)/(d(phi^(˙)))=((u^(˙)))/((phi^(˙)))=-(M(r^(˙)))/(r^(2)(phi^(˙)))=-(M)/( alpha)(1+2u)r^(˙);\frac{d u}{d \dot{\phi}}=\frac{\dot{u}}{\dot{\phi}}=-\frac{M \dot{r}}{r^{2} \dot{\phi}}=-\frac{M}{\alpha}(1+2 u) \dot{r} ;dudϕ˙=u˙ϕ˙=Mr˙r2ϕ˙=Mα(1+2u)r˙;
(4) in H H HHH, eliminate r ˙ r ˙ r^(˙)\dot{r}r˙ in favor of d u / d ϕ ˙ d u / d ϕ ˙ du//dphi^(˙)d u / d \dot{\phi}du/dϕ˙ via the above equation, and eliminate r r rrr in favor of u u uuu; (5) solve for d u / d ϕ d u / d ϕ du//d phid u / d \phidu/dϕ. The result is
(11) ( d u d ϕ ) 2 + u 2 = ( γ 2 1 + 2 u ) M 2 α 2 [ 1 + 2 u 1 2 u ] (11) d u d ϕ 2 + u 2 = γ 2 1 + 2 u M 2 α 2 1 + 2 u 1 2 u {:(11)((du)/(d phi))^(2)+u^(2)=(gamma^(2)-1+2u)(M^(2))/(alpha^(2))[(1+2u)/(1-2u)]:}\begin{equation*} \left(\frac{d u}{d \phi}\right)^{2}+u^{2}=\left(\gamma^{2}-1+2 u\right) \frac{M^{2}}{\alpha^{2}}\left[\frac{1+2 u}{1-2 u}\right] \tag{11} \end{equation*}(11)(dudϕ)2+u2=(γ21+2u)M2α2[1+2u12u]
Neglecting cubic and higher powers of u = G M / c 2 r ( 1 γ 2 ) u = G M / c 2 r 1 γ 2 u=GM//c^(2)r∼(1-gamma^(2))u=G M / c^{2} r \sim\left(1-\gamma^{2}\right)u=GM/c2r(1γ2) in this equation, derive the perihelion shift. (For details of method, see exercise 40.4, with the γ γ gamma\gammaγ and α α alpha\alphaα of this box renamed E ~ E ~ widetilde(E)\widetilde{E}E~ and L ~ L ~ widetilde(L)\widetilde{L}L~, and with the γ γ gamma\gammaγ and β β beta\betaβ of that exercise set equal to 1 and 0 .) The resulting shift per orbit is
(12) Δ ϕ = 8 π M / r 0 + O ( [ M / r 0 ] 2 ) . (12) Δ ϕ = 8 π M / r 0 + O M / r 0 2 . {:(12)Delta phi=8pi M//r_(0)+O([M//r_(0)]^(2)).:}\begin{equation*} \Delta \phi=8 \pi M / r_{0}+O\left(\left[M / r_{0}\right]^{2}\right) . \tag{12} \end{equation*}(12)Δϕ=8πM/r0+O([M/r0]2).
This is 4 3 4 3 (4)/(3)\frac{4}{3}43 the prediction of general relativity; and it disagrees with the observations on Mercury (see Box 40.3).

E. Bending of Light (exercise 7.3E)

The deflection angle for light passing the sun is, on dimensional grounds, a small quantity, Δ ϕ M / R 10 6 Δ ϕ M / R 10 6 Delta phi∼M_(o.)//R_(o.)∼10^(-6)\Delta \phi \sim M_{\odot} / R_{\odot} \sim 10^{-6}ΔϕM/R106; from the outset, one makes approximations based on this smallness. A diagram of the photon trajectory, set in the x , z x , z x,zx, zx,z-plane, shows that, for initial motion parallel to the z z zzz-axis, the deflection angle can be expressed in terms of the final momentum as Δ ϕ = p x / p z Δ ϕ = p x / p z Delta phi=p_(x)//p_(z)\Delta \phi=p_{x} / p_{z}Δϕ=px/pz. Compute the final p x p x p_(x)p_{x}px by an integral along the trajectory,
p x = + ( d p x / d z ) d z p x = + d p x / d z d z p_(x)=int_(-oo)^(+oo)(dp_(x)//dz)dzp_{x}=\int_{-\infty}^{+\infty}\left(d p_{x} / d z\right) d zpx=+(dpx/dz)dz

treating p z p z p_(z)p_{z}pz as essentially constant. This computation requires generalization of the equation of motion (2) to the case of zero rest mass. To handle the limit m 0 m 0 m longrightarrow0m \longrightarrow 0m0, introduce a new parameter λ = τ / m λ = τ / m lambda=tau//m\lambda=\tau / mλ=τ/m; then p μ = m ( d z μ / d τ ) = d z μ / d λ p μ = m d z μ / d τ = d z μ / d λ p^(mu)=m(dz^(mu)//d tau)=dz^(mu)//d lambdap^{\mu}=m\left(d z^{\mu} / d \tau\right)=d z^{\mu} / d \lambdapμ=m(dzμ/dτ)=dzμ/dλ. Also define p μ = ( η μ ν + h μ ν ) p ν p μ = η μ ν + h μ ν p ν p_(mu)=(eta_(mu nu)+h_(mu nu))p^(nu)p_{\mu}=\left(\eta_{\mu \nu}+h_{\mu \nu}\right) p^{\nu}pμ=(ημν+hμν)pν, since this quantity appears simply in equation (2) and agrees with p μ p μ p_(mu)p_{\mu}pμ in the limit r r r longrightarrow oor \longrightarrow \inftyr, where one will need to evaluate it. Then equation (2) reads, for any m m mmm, including m = 0 m = 0 m=0m=0m=0,
d P μ d λ = 1 2 h α β , μ p α p β d P μ d λ = 1 2 h α β , μ p α p β (dP_(mu))/(d lambda)=(1)/(2)h_(alpha beta,mu)p^(alpha)p^(beta)\frac{d P_{\mu}}{d \lambda}=\frac{1}{2} h_{\alpha \beta, \mu} p^{\alpha} p^{\beta}dPμdλ=12hαβ,μpαpβ
On the righthand side here, since h α β , μ h α β , μ h_(alpha beta,mu)h_{\alpha \beta, \mu}hαβ,μ is small, a crude approximation to p μ p μ p^(mu)p^{\mu}pμ suffices; p 1 = p 2 = 0 , p 0 = p 3 = d z / d λ = ω = p 1 = p 2 = 0 , p 0 = p 3 = d z / d λ = ω = p^(1)=p^(2)=0,p^(0)=p^(3)=dz//d lambda=omega=p^{1}=p^{2}=0, p^{0}=p^{3}=d z / d \lambda=\omega=p1=p2=0,p0=p3=dz/dλ=ω= constant. Thus,
d P 1 d λ = 1 2 ( h 00 + 2 h 03 + h 33 ) 1 ω 2 d P 1 d λ = 1 2 h 00 + 2 h 03 + h 33 1 ω 2 (dP_(1))/(d lambda)=(1)/(2)(h_(00)+2h_(03)+h_(33))_(1)omega^(2)\frac{d P_{1}}{d \lambda}=\frac{1}{2}\left(h_{00}+2 h_{03}+h_{33}\right)_{1} \omega^{2}dP1dλ=12(h00+2h03+h33)1ω2
and
1 p 3 d P 1 d z = 1 2 ( h 00 + 2 h 03 + h 33 ) , 1 1 p 3 d P 1 d z = 1 2 h 00 + 2 h 03 + h 33 , 1 (1)/(p_(3))(dP_(1))/(dz)=(1)/(2)(h_(00)+2h_(03)+h_(33))_(,1)\frac{1}{p_{3}} \frac{d P_{1}}{d z}=\frac{1}{2}\left(h_{00}+2 h_{03}+h_{33}\right)_{, 1}1p3dP1dz=12(h00+2h03+h33),1
For the sun, h 00 = h 33 = 2 M / r h 00 = h 33 = 2 M / r h_(00)=h_(33)=2M//rh_{00}=h_{33}=2 M / rh00=h33=2M/r, and h 03 = 0 h 03 = 0 h_(03)=0h_{03}=0h03=0 (equation 8 ), so
(13) Δ ϕ = ( p 1 p 3 ) final = ( P 1 p 3 ) final = 2 M d z ( 2 + z 2 ) 3 / 2 = 2 M d ζ ( 1 + ζ 2 ) 3 / 2 = 4 M (13) Δ ϕ = p 1 p 3 final  = P 1 p 3 final  = 2 M d z 2 + z 2 3 / 2 = 2 M d ζ 1 + ζ 2 3 / 2 = 4 M {:(13)Delta phi=-((p_(1))/(p_(3)))_("final ")=-((P_(1))/(p_(3)))_("final ")=int_(-oo)^(oo)(2Mℓdz)/((ℓ^(2)+z^(2))^(3//2))=(2M)/(ℓ)int_(-oo)^(oo)(d zeta)/((1+zeta^(2))^(3//2))=(4M)/(ℓ):}\begin{equation*} \Delta \phi=-\left(\frac{p_{1}}{p_{3}}\right)_{\text {final }}=-\left(\frac{P_{1}}{p_{3}}\right)_{\text {final }}=\int_{-\infty}^{\infty} \frac{2 M \ell d z}{\left(\ell^{2}+z^{2}\right)^{3 / 2}}=\frac{2 M}{\ell} \int_{-\infty}^{\infty} \frac{d \zeta}{\left(1+\zeta^{2}\right)^{3 / 2}}=\frac{4 M}{\ell} \tag{13} \end{equation*}(13)Δϕ=(p1p3)final =(P1p3)final =2Mdz(2+z2)3/2=2Mdζ(1+ζ2)3/2=4M
For light grazing the sun, = R = R ℓ=R_(o.)\ell=R_{\odot}=R, this gives Δ ϕ = 4 M / R Δ ϕ = 4 M / R Delta phi=4M_(o.)//R_(o.)\Delta \phi=4 M_{\odot} / R_{\odot}Δϕ=4M/R radians = 1 .75 = 1 .75 =1^('').75=1^{\prime \prime} .75=1.75, which is also the prediction of general relativity, and is consistent with the observations (see Box 40.1).

F. Gravitational Waves (exercise 7.3F)

The field equations (4) and gauge properties (5) of the present flat-spacetime theory are identical to those of Einstein's "linearized theory." Thus, the treatment of gravitational waves using linearized theory, which is presented in $ $ 18.2 , 35.3 $ $ 18.2 , 35.3 $$18.2,35.3\$ \$ 18.2,35.3$$18.2,35.3, and 35.4, applies here.

G. Positive Energy of the Waves (exercise 7.3G)

Computing a general formula for K K K\mathscr{K}K from equation (7.17) is tedious, but it is sufficient to consider only the special case of a plane wave (equation 7.13)-or better still,
Box 7.1 (continued)
a plane wave with only h 12 = h 21 = f ( z t ) h 12 = h 21 = f ( z t ) h_(12)=h_(21)=f(z-t)h_{12}=h_{21}=f(z-t)h12=h21=f(zt). Any gravitational wave can be constructed as a superposition of such plane waves. First compute the Langrangian for this case. According to equation (7.8), it reads
E f = ( 32 π ) 1 [ ( h 12 , 0 ) 2 ( h 12 , 3 ) 2 ] . E f = ( 32 π ) 1 h 12 , 0 2 h 12 , 3 2 . E_(f)=(32 pi)^(-1)[(h_(12,0))^(2)-(h_(12,3))^(2)].\mathcal{E}_{f}=(32 \pi)^{-1}\left[\left(h_{12,0}\right)^{2}-\left(h_{12,3}\right)^{2}\right] .Ef=(32π)1[(h12,0)2(h12,3)2].
Now the full content of the formula (7.17) defining K K K\mathscr{K}K is precisely the following: start from the Lagrangian; keep all terms that are quadratic in time derivatives; omit all terms that are linear in time derivatives; and reverse the sign of terms that contain no time derivatives. The result is
(14) K = ( 32 π ) 1 [ ( h 12 , 0 ) 2 + ( h 12 , 3 ) 2 ] , (14) K = ( 32 π ) 1 h 12 , 0 2 + h 12 , 3 2 , {:(14)K=(32 pi)^(-1)[(h_(12,0))^(2)+(h_(12,3))^(2)]",":}\begin{equation*} \mathscr{K}=(32 \pi)^{-1}\left[\left(h_{12,0}\right)^{2}+\left(h_{12,3}\right)^{2}\right], \tag{14} \end{equation*}(14)K=(32π)1[(h12,0)2+(h12,3)2],
which is positive.

H. Self-Inconsistency of the Theory (exercise 7.3H)

From equation (7.10), find
T μ ν , ν = m d z μ d τ d z ν d τ x ν δ 4 [ x z ( τ ) ] d τ . T μ ν , ν = m d z μ d τ d z ν d τ x ν δ 4 [ x z ( τ ) ] d τ . T^(mu nu)_(,nu)=m int(dz^(mu))/(d tau)(dz^(nu))/(d tau)(del)/(delx^(nu))delta^(4)[x-z(tau)]d tau.T^{\mu \nu}{ }_{, \nu}=m \int \frac{d z^{\mu}}{d \tau} \frac{d z^{\nu}}{d \tau} \frac{\partial}{\partial x^{\nu}} \delta^{4}[\boldsymbol{x}-\boldsymbol{z}(\tau)] d \tau .Tμν,ν=mdzμdτdzνdτxνδ4[xz(τ)]dτ.
But δ 4 ( x z ) δ 4 ( x z ) delta^(4)(x-z)\delta^{4}(\boldsymbol{x}-\boldsymbol{z})δ4(xz) depends only on the difference x μ z μ x μ z μ x^(mu)-z^(mu)x^{\mu}-z^{\mu}xμzμ, so / z ν / z ν -del//delz^(nu)-\partial / \partial z^{\nu}/zν can replace / x ν / x ν del//delx^(nu)\partial / \partial x^{\nu}/xν when acting on the δ δ delta\deltaδ-function. Noting that
d z v d τ z ν δ 4 [ x z ( τ ) ] = d d τ δ 4 [ x z ( τ ) ] , d z v d τ z ν δ 4 [ x z ( τ ) ] = d d τ δ 4 [ x z ( τ ) ] , (dz^(v))/(d tau)(del)/(delz^(nu))delta^(4)[x-z(tau)]=(d)/(d tau)delta^(4)[x-z(tau)],\frac{d z^{v}}{d \tau} \frac{\partial}{\partial z^{\nu}} \delta^{4}[\boldsymbol{x}-\boldsymbol{z}(\tau)]=\frac{d}{d \tau} \delta^{4}[\boldsymbol{x}-\boldsymbol{z}(\tau)],dzvdτzνδ4[xz(τ)]=ddτδ4[xz(τ)],
rewrite T μ ν , v T μ ν , v T^(mu nu)_(,v)T^{\mu \nu}{ }_{, v}Tμν,v as
T μ ν , ν = m z ˙ μ ( d / d τ ) δ 4 [ x z ( τ ) ] d τ = + m z ¨ μ δ 4 [ x z ( τ ) ] d τ T μ ν , ν = m z ˙ μ ( d / d τ ) δ 4 [ x z ( τ ) ] d τ = + m z ¨ μ δ 4 [ x z ( τ ) ] d τ T^(mu nu)_(,nu)=-m intz^(˙)^(mu)(d//d tau)delta^(4)[x-z(tau)]d tau=+m intz^(¨)^(mu)delta^(4)[x-z(tau)]d tauT^{\mu \nu}{ }_{, \nu}=-m \int \dot{z}^{\mu}(d / d \tau) \delta^{4}[\boldsymbol{x}-\boldsymbol{z}(\tau)] d \tau=+m \int \ddot{z}^{\mu} \delta^{4}[\boldsymbol{x}-\boldsymbol{z}(\tau)] d \tauTμν,ν=mz˙μ(d/dτ)δ4[xz(τ)]dτ=+mz¨μδ4[xz(τ)]dτ
(The last step is obtained by an integration by parts.) Thus T μ ν , ν = 0 T μ ν , ν = 0 T^(mu nu)_(,nu)=0T^{\mu \nu}{ }_{, \nu}=0Tμν,ν=0 holds if and only if z ¨ μ = 0 z ¨ μ = 0 z^(¨)^(mu)=0\ddot{z}^{\mu}=0z¨μ=0. But z ¨ μ = 0 z ¨ μ = 0 z^(¨)^(mu)=0\ddot{z}^{\mu}=0z¨μ=0 means that the gravitational fields have no effect on the motion of the particle. But this contradicts the equation of motion (2), which follows from the theory's variational principle. Thus, this tensor theory of gravity is inconsistent. [Stated briefly, equation (4) requires T μ v , ν = 0 T μ v , ν = 0 T^(mu v)_(,nu)=0T^{\mu v}{ }_{, \nu}=0Tμv,ν=0, while equation (2) excludes it.]
The fact that, in this theory, gravitating bodies cannot be affected by gravity, also holds for bodies made of arbitrary stress-energy (e.g., rubber balls or the Earth). Since all bodies gravitate, since the field equations imply T μ ν , ν = 0 T μ ν , ν = 0 T^(mu nu)_(,nu)=0T^{\mu \nu}{ }_{, \nu}=0Tμν,ν=0, and since this "equation of motion for stress-energy" implies conservation of a body's total 4-momentum P μ = T μ 0 d 3 x P μ = T μ 0 d 3 x P^(mu)=intT^(mu0)d^(3)xP^{\mu}=\int T^{\mu 0} d^{3} xPμ=Tμ0d3x, no body can be accelerated by gravity. The Earth cannot be attracted by the sun; it must fly off into interstellar space!
Straightforward steps to repair this inconsistency in the theory lead inexorably to general relativity (see Box 17.2 part 5). Having adopted general relativity as the theory of gravity, one can then use the present flat-spacetime theory as an approximation to it ("Linearized general relativity"; treated in Chapters 18, 19, and 35 ; see especially discussion at end of §18.3).

§7.2. GRAVITATIONAL RED SHIFT DERIVED FROM ENERGY CONSERVATION

Einstein argued against the existence of any ideal, straight-line reference system such as is assumed in Newtonian theory. He emphasized that nothing in a natural state of motion, not even a photon, could ever give evidence for the existence or location of such ideal straight lines.
That a photon must be affected by a gravitational field Einstein (1911) showed from the law of conservation of energy, applied in the context of Newtonian gravitation theory. Let a particle of rest mass m m mmm start from rest in a gravitational field g g ggg at point C C C\mathscr{C}C and fall freely for a distance h h hhh to point G G G\mathscr{G}G. It gains kinetic energy m g h m g h mghm g hmgh. Its total energy, including rest mass, becomes
(7.18) m + m g h . (7.18) m + m g h . {:(7.18)m+mgh.:}\begin{equation*} m+m g h . \tag{7.18} \end{equation*}(7.18)m+mgh.
Now let the particle undergo an annihilation at B B B\mathscr{B}B, converting its total rest mass plus kinetic energy into a photon of the same total energy. Let this photon travel upward in the gravitational field to a a aaa. If it does not interact with gravity, it will have its original energy on arrival at a a aaa. At this point it could be converted by a suitable apparatus into another particle of rest mass m m mmm (which could then repeat the whole process) plus an excess energy m g h m g h mghm g hmgh that costs nothing to produce. To avoid this contradiction of the principal of conservation of energy, which can also be stated in purely classical terms, Einstein saw that the photon must suffer a red shift. The energy of the photon must decrease just as that of a particle does when it climbs out of the gravitational field. The photon energy at the top and the bottom of its path through the gravitational field must therefore be related by
(7.19) E bottom = E top ( 1 + g h ) = E top ( 1 + g conv h / c 2 ) (7.19) E bottom = E top ( 1 + g h ) = E top 1 + g conv h / c 2 {:(7.19)E_(bottom)=E_(top)(1+gh)=E_(top)(1+g_(conv)h//c^(2)):}\begin{equation*} E_{\mathrm{bottom}}=E_{\mathrm{top}}(1+g h)=E_{\mathrm{top}}\left(1+g_{\mathrm{conv}} h / c^{2}\right) \tag{7.19} \end{equation*}(7.19)Ebottom=Etop(1+gh)=Etop(1+gconvh/c2)
The drop in energy because of work done against gravitation implies a drop in frequency and an increase in wavelength (red shift; traditionally stated in terms of a red shift parameter, z = Δ λ / λ z = Δ λ / λ z=Delta lambda//lambdaz=\Delta \lambda / \lambdaz=Δλ/λ ); thus,
(7.20) 1 + z = λ top λ bottom = h ν bottom h v top = E bottom E top = 1 + g h . (7.20) 1 + z = λ top  λ bottom  = h ν bottom  h v top  = E bottom  E top  = 1 + g h . {:(7.20)1+z=(lambda_("top "))/(lambda_("bottom "))=(hnu_("bottom "))/(hv_("top "))=(E_("bottom "))/(E_("top "))=1+gh.:}\begin{equation*} 1+z=\frac{\lambda_{\text {top }}}{\lambda_{\text {bottom }}}=\frac{h \nu_{\text {bottom }}}{h v_{\text {top }}}=\frac{E_{\text {bottom }}}{E_{\text {top }}}=1+g h . \tag{7.20} \end{equation*}(7.20)1+z=λtop λbottom =hνbottom hvtop =Ebottom Etop =1+gh.
The redshift predicted by this formula has been verified to 1 percent by Pound and Snider (1964, 1965), refining an experiment by Pound and Rebka (1960).

§7.3. GRAVITATIONAL REDSHIFT IMPLIES SPACETIME IS CURVED

An argument by Schild ( 1960 , 1962 , 1967 ) ( 1960 , 1962 , 1967 ) (1960,1962,1967)(1960,1962,1967)(1960,1962,1967) yields an important conclusion: the existence of the gravitational redshift shows that a consistent theory of gravity cannot be constructed within the framework of special relativity.
Gravitational redshift derived from energy considerations
Figure 7.1.
Successive pulses of light rising from height z 1 z 1 z_(1)z_{1}z1, to height z 2 = z 1 + h z 2 = z 1 + h z_(2)=z_(1)+hz_{2}=z_{1}+hz2=z1+h against the gravitational field of the earth. The paths γ 1 γ 1 gamma_(1)\gamma_{1}γ1, and γ 2 γ 2 gamma_(2)\gamma_{2}γ2 must be exactly congruent, whether sloped at 45 45 45^(@)45^{\circ}45 (left) or having variable slope (right).
Assume gravity is described by an (unspecified) field in flat spacetime . . .
Whereas Einstein's argument (last section) was formulated in Newtonian theory, Schild's is formulated in special relativity. It analyzes gravitational redshift experiments in the field of the Earth, using a global Lorentz frame tied to the Earth's center. It makes no demand that free particles initially at rest remain at rest in this global Lorentz frame (except far from the Earth, where gravity is negligible). On the contrary, it demands that free particles be accelerated relative to the Lorentz frame by the Earth's gravitational field. It is indifferent to the mathematical nature of that field (scalar, vector, tensor, ...), but it does insist that the gravitational accelerations agree with experiment. And, of course, it demands that proper lengths and times be governed by the metric of special relativity.
Schild's argument proceeds as follows. Consider one observer at rest on the Earth's surface at height z 1 z 1 z_(1)z_{1}z1, and a second above the Earth's surface at height z 2 = z 1 + h z 2 = z 1 + h z_(2)=z_(1)+hz_{2}=z_{1}+hz2=z1+h (Figure 7.1). The observers may verify that they are at rest relative to each other and relative to the Earth's Lorentz frame by, for instance, radar ranging to free particles that are at rest in the Earth's frame far outside its gravitational field. The bottom experimenter then emits an electromagnetic signal of a fixed standard frequency ω b ω b omega_(b)\omega_{b}ωb which is received by the observer on top. For definiteness, let the signal be a pulse exactly N N NNN cycles long. Then the interval of time* δ τ bot δ τ bot  deltatau_("bot ")\delta \tau_{\text {bot }}δτbot  required for the emission of the pulse is given by 2 π N = ω b δ τ bot 2 π N = ω b δ τ bot  2pi N=omega_(b)deltatau_("bot ")2 \pi N=\omega_{b} \delta \tau_{\text {bot }}2πN=ωbδτbot . The observer at the top is then to receive these same N N NNN cycles of the electromagnetic wave pulse and measure the time interval* δ τ top δ τ top  deltatau_("top ")\delta \tau_{\text {top }}δτtop  required. By the definition of "frequency," it satisfies 2 π N = 2 π N = 2pi N=2 \pi N=2πN= ω t δ τ top ω t δ τ top  omega_(t)deltatau_("top ")\omega_{t} \delta \tau_{\text {top }}ωtδτtop . The redshift effect, established by experiment (for us) or by energy conservation (for Einstein), shows ω t < ω b ω t < ω b omega_(t) < omega_(b)\omega_{t}<\omega_{b}ωt<ωb; consequently the time intervals are different, δ τ top > δ τ bot δ τ top  > δ τ bot  deltatau_("top ") > deltatau_("bot ")\delta \tau_{\text {top }}>\delta \tau_{\text {bot }}δτtop >δτbot . Transfer this information to the special-relativity spacetime diagram of the experiment (Figure 7.1). The waves are light rays; so one might show them as traveling along 45 45 45^(@)45^{\circ}45 null lines in the spacetime diagram (Figure 7.1,A). In this
simplified but slightly inadequate form of the argument, one reaches a contradiction by noticing that here one has drawn a parallelogram in Minkowski spacetime in which two of the sides are unequal, τ top > τ bot τ top  > τ bot  tau_("top ") > tau_("bot ")\tau_{\text {top }}>\tau_{\text {bot }}τtop >τbot , whereas a parallelogram in flat Minkowski spacetime cannot have opposite sides unequal. One concludes that special relativity cannot be valid over any sufficiently extended region. Globally, spacetime, as probed by the tracks of light rays and test particles, departs from flatness ("curvature"; Parts III and IV of this book), despite the fine fit that Lorentz-Minkowski flatness gives to physics locally.
Figure 7.1,B, repairs an oversimplification in this argument by recognizing that the propagation of light will be influenced by the gravitational field. Therefore photons might not follow straight lines in the diagram. Consequently, the world lines γ 1 γ 1 gamma_(1)\gamma_{1}γ1 and γ 2 γ 2 gamma_(2)\gamma_{2}γ2 of successive pulses are curves. However, the gravitational field is static and the experimenters do not move. Therefore nothing in the experimental setup changes with time. Whatever the path γ 1 γ 1 gamma_(1)\gamma_{1}γ1, the path γ 2 γ 2 gamma_(2)\gamma_{2}γ2 must be a congruent path of exactly the same shape, merely translated in time. On the basis of this congruence and the fact that the observers are moving on parallel world lines, one would again conclude, if flat Minkowski geometry were valid, that τ bot = τ top τ bot  = τ top  tau_("bot ")=tau_("top ")\tau_{\text {bot }}=\tau_{\text {top }}τbot =τtop , thus contradicting the observed redshift experiment. The experimenters do not need to understand the propagation of light in a gravitational field. They need only use their radar apparatus to verify the fact that they are at rest relative to each other and relative to the source of the gravitational field. They know that, whatever influence the gravitational field has on their radar apparatus, it will not be a time-dependent influence. Moreover, they do not have to know how to compute their separation in order to verify that the separation remains constant. They only need to verify that the round-trip time required for radar pulses to go out to each other and back is the same every time they measure it.
Schild's redshift argument does not reveal what kind of curvature must exist, or whether the curvature exists in the neighborhood of the observational equipment or some distance away from it. It does say, however, quite unambigously, that the flat spacetime of special relativity is inadequate to describe the situation, and it should therefore motivate one to undertake the mathematical analysis of curvature in Part III.

§7.4. GRAVITATIONAL REDSHIFT AS EVIDENCE FOR THE PRINCIPLE OF EQUIVALENCE

Einstein ( 1908 , 1911 ) ( 1908 , 1911 ) (1908,1911)(1908,1911)(1908,1911) elevated the idea of the universality of gravitational interactions to the status of a fundamental principle of equivalence, that all effects of a uniform gravitational field are identical to the effects of a uniform acceleration of the coordinate system. This principle generalized a result of Newtonian gravitation theory, in which a uniform acceleration of the coordinate system in equation (7.1) gives rises to a
This assumption is incompatible with gravitational redshift
Conclusion: spacetime is curved
Principle of equivalence: a uniform gravitational field is indistinguishable from a uniform acceleration of a reference frame
Gravitational redshift derived from equivalence principle
Equivalence principle implies nonmeshing of local Lorentz frames near Earth (spacetime curvature!)
supplementary uniform gravitational field. However, the Newtonian theory only gives this result for particle mechanics. Einstein's principle of equivalence asserts that a similar correspondence will hold for all the laws of physics, including Maxwell's equations for the electromagnetic field.
The rules of the game - the "scientific method"-require that experimental support be sought for any new theory or principle, and Einstein could treat the gravitational redshift as the equivalent of experimental confirmation of his principle of equivalence. There are two steps in such a confirmation: first, the theory or principle must predict an effect (the next paragraph describes how the equivalence principle implies the redshift); second, the predicted effect must be observed. With the Pound-RebkaSnider experiments, one is in much better shape today than Einstein was for this second step. Einstein himself had to rely on the experiments supporting the general concept of energy conservation, plus the necessity of a redshift to preserve energy conservation, as a substitute for direct experimental confirmation.
The existence of the gravitational redshift can be deduced from the equivalence principle by considering two experimenters in a rocket ship that maintains a constant acceleration g g ggg. Let the distance between the two observers be h h hhh in the direction of the acceleration. Suppose for definiteness that the rocket ship was at rest in some inertial coordinate system when the bottom observer sent off a photon. It will require time t = h t = h t=ht=ht=h for the photon to reach the upper observer. In that time the top observer acquires a velocity v = g t = g h v = g t = g h v=gt=ghv=g t=g hv=gt=gh. He will therefore detect the photon and observe a Doppler redshift z = v = g h z = v = g h z=v=ghz=v=g hz=v=gh. The results here are therefore identical to equation (7.20). The principle of equivalence of course requires that, if this redshift is observed in an experiment performed under conditions of uniform acceleration in the absence of gravitational fields, then the same redshift must be observed by an experiment performed under conditions where there is a uniform gravitational field, but no acceleration. Consequently, by the principle of equivalence, one can derive equation (7.20) as applied to the gravitational situation.

§7.5. LOCAL FLATNESS, GLOBAL CURVATURE

The equivalence principle helps one to discern the nature of the spacetime curvature, whose existence was inferred from Schild's argument. Physics is the same in an accelerated frame as it is in a laboratory tied to the Earth's surface. Thus, an Earth-bound lab can be regarded as accelerating upward, with acceleration g g ggg, relative to the Lorentz frames in its neighborhood.* Equivalently, relative to the lab and the Earth's surface, all Lorentz frames must accelerate downward. But the downward (radial) direction is different at different latitudes and longitudes. Hence, local Lorentz frames, initially at rest with respect to each other but on opposite sides of the Earth, subsequently fall toward the center and go flying through each other. Clearly they cannot be meshed to form the single global Lorentz frame, tied to the
Earth, that was assumed in Schild's argument. This nonmeshing of local Lorentz frames, like the nonmeshing of local Cartesian coordinates on a curved 2 -surface, is a clear manifestation of spacetime curvature.
Geographers have similar problems when mapping the surface of the earth. Over small areas, a township or a county, it is easy to use a standard rectangular coordinate system. However, when two fairly large regions are mapped, each with one coordinate axis pointing north, then one finds that the edges of the maps overlap each other best if placed at a slight angle (spacetime analog: relative velocity of two local Lorentz frames meeting at center of Earth). It is much easier to start from a picture of a spherical globe, and then talk about how small flat maps might be used as good approximations to parts of it, than to start with a huge collection of small maps and try to piece them together to build up a picture of the globe. The exposition of the geometry of spacetime in this book will therefore take the first approach. Now that one recognizes that the problem is to fit together local, flat spacetime descriptions of physics into an over-all view of the universe, one should be happy to jump, in the next chapter, into a broadscale study of geometry. From this more advantageous viewpoint, one can then face the problem of discussing the relationship between the local inertial coordinate systems appropriate to two nearby regions that have slightly different gravitational fields.
There are actually two distinguishable ways in which geometry enters the theory of general relativity. One is the geometry of lengths and angles in four-dimensional spacetime, which is inherited from the metric structure d s 2 d s 2 ds^(2)d s^{2}ds2 of special relativity. Schild's argument already shows (without a direct appeal to the equivalence principle) that the special-relativistic ideas of length and angle must be modified. The modified ideas of metric structure lead to Riemannian geometry, which will be treated in Chapters 8 and 13. However, geometry also enters general relativity because of the equivalence principle. An equivalence principle can already be stated within Newtonian gravitational theory, in which no concepts of a spacetime metric enter, but only the Euclidean metric structure of three-dimensional space. The equivalence-principle view of Newtonian theory again insists that the local standard of reference be the freely falling particles. This requirement leads to the study of a spacetime geometry in which the curved world lines of freely falling particles are defined to be locally straight. They play the role in a curved spacetime geometry that straight lines play in flat spacetime. This "affine geometry" will be studied in Chapters 10-12. It leads to a quantitative formulation of the ideas of "covariant derivative" and "curvature" and even "curvature of Newtonian spacetime"!
Nonmeshing of local Lorentz frames motivates study of geometry
Two types of geometry relevant to spacetime:
Riemannian geometry (lengths and angles)
Affine geometry (" straight lines" and curvature)

PART
III

THE MATHEMATICS OF CURVED SPACETIME

Wherein the reader is exposed to the charms of a new temptress-
Modern Differential Geometry-and makes a decision: to embrace her for eight full chapters; or, having drunk his fill, to escape after one.

    • Proper time equals Lorentz coordinate time for both observers, since they are at rest in the Earth's Lorentz frame.
  1. *This upward acceleration of the laboratory, plus equation (6.18) for the line element in an accelerated coordinate system, explains the nonequality of the bottom and top edges of the parallelograms in Figure 7.1.